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EDITORIAL 


A quantum future awaits 


century ago, the quantum revolution quietly be- 
gan to change our lives. A deeper understanding 
of the behavior of matter and light at atomic and 
subatomic scales sparked a new field of science 
that would vastly change the world’s technol- 
ogy landscape. Today, we rely upon the science of 
quantum mechanics for applications ranging from 
the Global Positioning System to magnetic resonance im- 
aging to the transistor. The advent of quantum comput- 
ers presages yet another new 
chapter in this story that will 
enable us to not only predict 
and improve chemical reac- 
tions and new materials and 
their properties, for example, 
but also to provide insights into 
the emergence of spacetime 
and our universe. Remarkably, 
these advances may begin to be 
realized in a few years. 

From initial steps in the 
1980s to today, science and 
defense agencies around the 
world have supported the ba- 
sic research in quantum infor- 
mation science that enables 
advanced sensing, communi- 
cation, and computational sys- 
tems. Recent improvements in 
device performance and quan- 
tum bit (“qubit”) approaches 
show the possibility of moder- 
ate-scale quantum computers 
in the near future. This prog- 
ress has focused the scientific 
community on, and engendered substantial new indus- 
trial investment for, developing machines that produce 
answers we cannot simulate even with the world’s fastest 
supercomputer (currently the Summit supercomputer 
at the U.S. Department of Energy’s Oak Ridge National 
Laboratory in Tennessee). 

Achieving such quantum computational supremacy is 
a natural first goal. It turns out, however, that devising 
a classical computer to approximate quantum systems is 
sometimes good enough for the purposes of solving cer- 
tain problems. Furthermore, most quantum devices have 
errors and produce correct results with a decreasing 
probability as problems become more complicated. Only 
with substantial math from quantum complexity theory 
can we actually separate “stupendously hard” problems 
to solve from just “really hard” ones. This separation of 


“Achieving such quantum 
computational supremacy is 
a natural first goal.” 


classical and quantum computation is typically described 
as approaching quantum supremacy. A device that dem- 
onstrates a separation may rightly deserve to be called 
the world’s first quantum computer and will represent a 
leap forward for theoretical computer science and even 
for our understanding of the universe. 

Once a real quantum computer is realized, what’s 
next? In the coming decade, we can expect that some 
problem-solving will be optimized much more rapidly 
using quantum devices. We 
can also expect that efficient 
sampling from a probability 
distribution—the theoretical 
version of a machine learning 
algorithm—will become a place 
where quantum computers can 
shine. In the longer term, error 
correction and factoring may 
change the landscape further. 

However, the lowest hang- 
ing fruit will be improving 
the ability to work with quan- 
tum mechanics. In the past, 
knowledge of quantum me- 
chanics has been refined by 
comparing classical compu- 
tational techniques to what 
has been observed by experi- 
ments—from solving differen- 
tial equations to brute force 
simulation to new approxi- 
mation methods in chemis- 
try and materials science. If 
quantum computational su- 
premacy is achieved, we may 
be able to test new techniques without requiring such 
comparison. This will reduce the cycle of research and 
transform how science is conducted. 

Aiming toward these outcomes, the White House Office 
of Science and Technology Policy has established a new 
interagency group that is tasked with creating a national 
strategy to nurture a full quantum ecosystem through co- 
ordinated research between government, academia, and 
industry. This will include engagement across community 
boundaries and between disciplines to ensure a strong, 
quantum-smart future workforce. Matching this with 
similar efforts worldwide should allow us to catch glim- 
mers of new scientific horizons and to develop the po- 
tential industries and new technologies that may emerge 
from investing in quantum information science. 

-Jacob M. Taylor 
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CONSERVATION 


"Trump 

to overhaul 
species 
protections - 


The Endangered Species Act covers the woodland caribou (Rangifer tarandus caribou). 


resident Donald Trump’s administration announced last week it 

plans major changes to how federal agencies interpret and apply 

the Endangered Species Act. One change proposed on 25 July 

by the U.S. Fish and Wildlife Service and the National Oceanic 

and Atmospheric Administration would mean that, in the future, 

species designated as threatened could receive less protection 
than those listed as endangered. The agencies also want to reduce how 
far into the future they must look when evaluating extinction risk and 
to have the flexibility to shrink the area of so-called critical habitat a 
species might need to survive. In addition, the agencies plan to begin 
to describe the financial costs of protecting certain species, although 
they will continue to base determinations to protect them solely on 
biological considerations. Industry groups are generally welcoming the 
revisions, which the executive branch expects to adopt administratively 
later this year after a public comment period; conservation groups warn 
the changes will undermine efforts to save many species. 
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Portion of the world’s oceans that remains as wilderness, 
minimally affected by humans. Most is in polar regions 
and around remote Pacific Island nations—and less than 
5% is protected (Current Biology). 


Hungary takes academy’s funding 


SCIENCE POLICY | Hungary’s budget for 
2019, which its Parliament approved on 

20 July, transfers two-thirds of the Budapest- 
based Hungarian Academy of Sciences’s 
(MTAs) €124 million budget to the Ministry 
for Innovation and Technology (ITM). The 
government says this will improve efficiency, 
but many scientists in Hungary and abroad 
see the move as a power grab by an increas- 
ingly authoritarian regime that will extend 
political influence over science spending and 
research agendas. “We accept the outcome, 
but we expect to continue negotiating with 
[ITM] Minister Laszlo Palkovics to see what 
the exact specifics of this new situation will 
be,’ says Tamas Simon, an MTA spokes- 
person. MTA President Laszl6 Lovasz says 
Palkovics has not supported basic, “bottom- 
up” research proposals, which Lovasz calls 
necessary to produce cutting-edge science. 


Viagra study halted over risks 


BIOMEDICINE | A Dutch study testing 
whether administering sildenafil, better 
known as Viagra, to pregnant women 
can help poorly growing fetuses was 
halted on 19 July after evidence emerged 
that the drug may cause a potentially 
fatal complication in the newborns. Fetal 
growth restriction, which can lead to 
stillbirth and neonatal death, is caused 
by a lack of blood flow from the placenta 
to the fetus. Sildenafil dilates these blood 
vessels, which smaller clinical studies 
had suggested might benefit the fetus. 
But an interim analysis of the new study 
at Amsterdam University Medical Center 
found that pregnant women who took 
the drug had a higher risk of losing their 
babies after birth because the newborns 
experienced lung hypertension—11 in the 
treatment group died of this cause. Similar 
studies in three other countries found that 
sildenafil had no benefit, but did no harm 
either; a study underway in Canada has 
been put on hold based on the new data. 


Fight brews over census choice 


STATISTICS | Ifthere’s a fight over Steven 
Dillingham, whom President Donald Trump 
nominated on 18 July to lead the U.S. 
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Census Bureau, it’s likely to focus on 

the administration’s commitment to an 
accurate count in 2020, not his experience 
leading two smaller federal statistical agen- 
cies or his multiple degrees. Secretary of 
Commerce Wilbur Ross has been sued for 
adding a citizenship question to the 2020 
census, and there’s continuing concern over 
whether the agency can pull off the first- 
ever head count using internet responses. 
“Steve is a tip-top scholar, a progressive 
administrator, and a very ethical guy,” 
says Geoff Alpert, a professor of criminology 
at the University of South Carolina in 
Columbia, who worked under Dillingham at 
the Bureau of Justice Statistics in the early 
1990s. But Representative Carolyn Maloney 
(D-NY) wants Dillingham “to reject the 
Administration’s attempt to add a citizen- 
ship question” before any Senate vote. 


The real-life Lorax? 


POPULAR CULTURE | In Theodor “Dr. Seuss’ 
Geisel’s beloved picture book The Loraz, the 
titular character tries to defend his forested 
home from an insatiable businessman—a 
lasting allegory for environmental pro- 
tection. This week, scientists report they 
have identified a species of monkey that 
may have inspired the Lorax’s implike 
features—and in a twist the Lorax himself 
might lament, they note that environmen- 
tal changes have reduced the monkey’s 
range by half since 1991. Scientists report 
this week in Nature Ecology & Evolution 
that Geisel may have based the Lorax on 
the patas monkey (Erythrocebus patas). 
Although no evidence directly links the 
fictional and real creatures, historical docu- 
ments show that Geisel, who published the 
book in 1971 and died in 1991, was writing it 
while vacationing in Nanyuki, Kenya. There 
he could have encountered the animal and 
its main food source, the whistling thorn 
acacia (Acacia drepanolobium), which may 
have inspired the book’s “truffula” trees. 

To make a stronger case, the researchers 
conducted computer-aided facial analysis 


? 


Patas monkey populations in Kenya are at risk. 
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PUBLIC HEALTH 


Progress lags in HIV/AIDS fight 


he world is behind on meeting an ambitious United Nations target of ending AIDS 

as a public health threat by 2030. To accomplish this goal, set in 2015 by the Joint 

United Nations Programme on HIV/AIDS (UNAIDS) in Geneva, Switzerland, new infec- 

tions must drop to 90% of their 2010 levels. In a report titled Miles to Go and released 

on the eve of this week's International AIDS Conference in Amsterdam, UNAIDS 
Executive Director Michel Sidibé warns that “complacency” is setting in. To meet the 2030 
target, new infections must decline by 75% between 2010 and 2020. But new infections 
fell only 18% between 2010 and 2017. Sidibé stresses that “action now can still put us back 
on course to reach the 2020 targets,” adding, “This report is a wake-up call.” 
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of several species of Kenyan monkeys and 
found strong similarities between the Lorax 
and the patas monkey. Kenyan patas popu- 
lations are losing the acacias to drought and 
land clearance for agriculture. 


Sloppy science more widespread 


STATISTICS | Several studies have shown 
that many psychology researchers admit 

to questionable practices in analyzing and 
reporting data—a phenomenon that helps 
explain why so many of the field’s stud- 

ies have proved impossible to reproduce. 
But ecologists and evolutionary scientists 
aren’t behaving much better, according to 

a study published on 16 July in PLOS ONE 
by Hannah Fraser of the University of 
Melbourne in Australia and colleagues. 
Among 807 researchers surveyed in those 
disciplines, 64% reported they had at least 
once failed to report outcomes that were 
statistically insignificant; 42% had extended 
a study to collect more data and reach sta- 
tistical significance, a form of “p-hacking”; 
and 51% had reported an unexpected finding 
as if it had been hypothesized from the 
beginning. The rates resemble those found 
in psychology; remedies such as education, 
changing the incentive structure in science, 
and preregistration of studies could help 
bring them down, the authors write. 
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Recent geological age named 


NATURAL HISTORY | Scientists last 

week announced a new name for the past 
4200 years of geologic history: the 
Meghalayan age. The declaration by the 
International Commission on Stratigraphy 
dividesthe Holocene, our current, warm, 
human-dominated epoch, into three ages 
based on global changes: the Greenlandian, 
which started as glaciers retreated 

11,700 years ago; the Northgrippian, marked 
by a cooling trend 8300 years ago; and 

the Meghalayan, which began 4200 years 
ago with a mysterious drought and cooling 
trend that coincided with the decline 

of several civilizations. The Meghalayan’s 
marker in the rock record—called the 
“golden spike’—comes from oxygen isotopes 
found in a stalagmite from the Meghalaya 
plateau of northeast India. The newly 
defined age names and dates provide stan- 
dardized versions of divisions that scholars 
had long drawn between an early, middle, 
and late Holocene. The commission has not 
finished a study of whether the world, by 
the 1950s, left the Holocene and entered a 
new geological epoch, the Anthropocene, 
dominated by human influence. 
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Report details harassment by famed biologist 


Allegations that felled Francisco Ayala divide scientists, university community 


By Meredith Wadman 


he investigative report that earlier 

this month triggered the ouster of 

prominent evolutionary geneticist 

Francisco Ayala from the Univer- 

sity of California (UC), Irvine, con- 

cluded he had a pattern of harassing 
behavior toward women. The document 
concludes that he told a pregnant colleague 
“youre so huge” and repeatedly put his 
hands under a female administrator’s jacket, 
rubbing them up and down her sides. Ac- 
cording to the report, Ayala told a female 
professor that she had been so animated 
while giving a talk that he thought she 
would “have an orgasm.” In another in- 
stance, the report states, he invited a junior 
professor in a crowded meeting to sit on his 
lap, saying he would enjoy the presentation 
more that way. 

The 97-page report, completed in May 
and obtained by Science (currently available 
at http://bit.ly/AyalaReport), describes evi- 
dence of unwelcome conduct by Ayala that 
continued after he was warned to stop. The 
internal university investigation, prompted 
by complaints submitted by four UC Irvine 
women, detailed off-color remarks and re- 
peated unsolicited compliments on wom- 
en’s physical appearances; these behaviors 
often were witnessed by one or more of the 
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61 people interviewed for the investiga- 
tion. The investigators said the complain- 
ants felt professionally undermined by his 
conduct and they concluded that Ayala, 84, 
violated UC Irvine’s sexual harassment and 
sex discrimination policies in the cases of 
three of the four women who lodged com- 
plaints. The university accepted Ayala’s 
resignation, effective 1 July, and plans to 
strip his name from its science library and 
biology building. 

In responses included in the report, Ayala 
strenuously denies most of the allegations. 
He told investigators that the entire com- 
plaint of Department of Ecology and Evolu- 
tionary Biology (EEB) Professor and Chair 
Kathleen Treseder, who reported the “huge” 
and orgasm comments, “was a lie.” 

“I saw my compliments as courtesies. 
And they turned those courtesies into sex- 
ual harassment,’ Ayala told Science in an 
interview last week. “I have never intention- 
ally caused sexual harassment to anybody,’ 
he also wrote in an email to UC Irvine 
Chancellor Howard Gillman days after the 
probe was launched. “To the extent that my 
actions may have caused harm to others ... I 
apologize from the deepest of my heart and 
of my mind.” 

Three of the women who lodged com- 
plaints directed questions to their lawyer, 
who did not respond to repeated requests 
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for comment. Science was unable to reach 
the fourth complainant. 

Ayala’s firing has drawn praise from 
some scientists and sharp criticism from 
others on the UC Irvine campus and in his 
native Spain. Critics questioned whether he 
received due process and complained that 
the university had not detailed the charges 
against him. But the report’s contents seem 
only to have deepened the division. 

“T’m just shocked that this man’s life was 
ruined over this collection of reactions to 
his behavior,’ says Elizabeth Loftus, a UC 
Irvine experimental psychologist who has 
vocally supported Ayala. After reading the 
report, she said the “thin” allegations “are 
far, far from the obviously bad behaviors 
that we want to be punishing. I feel like: 
‘Who’s next?’” 

But Jane Zelikova, an ecologist at the 
University of Wyoming in Laramie and a 
founder of the international advocacy group 
500 Women Scientists, says: “No form of ha- 
rassment is OK. ... He could have corrected 
his behavior. He did not. Being fired for do- 
ing something that is illegal is justice.” 

Ann Olivarius, a senior attorney at McAI- 
lister Olivarius in Saratoga Springs, New 
York, who specializes in sexual harassment, 
reviewed the UC Irvine report at Science’s 
request. “Unlike many harassers who have 
sex with students or pressure them directly 
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In the wake of a harassment report, Francisco Ayala 
resigned from the University of California, Irvine. 


for sex, Ayala did not cross those bound- 
aries,” she notes. “But he clearly made 
multiple women feel degraded. ... Senior 
university officials warned him to stop act- 
ing in these ways, but he continued.” 

Robert Cook-Deegan, a historian of sci- 
ence with Arizona State University who is 
based in Washington, D.C., and also read the 
report, concluded that “Norms are changing 
really fast and I think this 84-year-old got 
caught in a norm shift.” 

Ayala, who was elected to the National 
Academy of Sciences in 1980, did pioneer- 
ing work in evolution and genetics and was 
a prominent public defender of evolution. 
In 1995, he was president of AAAS, which 
publishes Science. Since 2010, he has do- 
nated more than $11 million to UC Irvine. 

Ayala admitted to making the “sit in my 
lap” comment to complainant Jessica Pratt, 
an assistant teaching professor, in 2015, 
as she prepared to present at a crowded 
faculty meeting. Ayala told investigators 
it was a one-time lapse showing “a hor- 
rendous lack of judgment.” (A graduate 
student who was interviewed as a witness 
described a separate instance in which she 


said Ayala invited her, too, to sit in his lap.) 

Pratt complained to the 
then-EEB department chair, 
prompting Ayala to visit 
Pratt in her office, accord- 


“Dr. Ayala’s very 


felt she had to put up with Ayala’s atten- 
tions because of his power as a major do- 
nor. “I just learned that women don’t like 
to be told they’re beautiful, but I know you 
don’t mind,” Shipley recalled Ayala saying 
in 2016, rubbing her sides while kissing her 
cheeks in greeting—a behavior that occurred 
repeatedly, she said. A male professor noted 
this encounter and asked Shipley afterward 
whether she was all right, the report says. 

Shipley told investigators she was relieved 
when Ayala’s attention shifted to Treseder, 
to whom she said Ayala was “glued” at a 
department social event not long after. Dis- 
tressed, Treseder asked a male colleague, 
who corroborated her claim to investigators, 
to attend such events with her. 

In early November 2017, just before 
the complaints against Ayala were filed, 
Treseder, who had recently been named the 
department chair, proposed a code of con- 
duct concerning sexual harassment at a fac- 
ulty meeting. Ayala pushed back, according 
to the report, saying the “ladies” in the dean’s 
office wanted him to hug and kiss them. 

In the fourth complaint, graduate stu- 
dent Michelle Herrera alleged that Ayala 
put his hands on her bare shoulders—a 
behavior Ayala admitted to—and leaned 
his front against her back as she sat at a 
picnic table. Ayala vigorously denied lean- 
ing against her. Investiga- 
tors concluded that the 
incident probably occurred 
but might not have been 


ing to the report. Ayala told public punishment gender-based. They found 
investigators that he “apol- ° Herrera’s claims that Ayala 
ogized profusely” to Pratt, will send a loud repeatedly commented on 
telling her he intended signal that times her appearance not cred- 
the comment to be play- é 7 ible. Herrera could not be 
ful, like he was addressing 4 changing eee reached for comment. 


a niece or granddaughter. 
But when Pratt told Ayala 
that the comment was 
overheard—something other witnesses cor- 
roborated—Ayala called her a liar, accord- 
ing to the report. Pratt lodged an informal 
complaint with UC Irvine’s Office of Equal 
Opportunity and Diversity. She told inves- 
tigators that Ayala’s conduct “made her 
question whether he respected her work. 
.. She even began to question her own 
merit as a scientist.” 

As a result of Pratt’s complaint, Associ- 
ate Chancellor Kirsten Quanbeck warned 
Ayala in 2015 to watch his language with 
women and told him that his conduct was 
viewed as unwelcome and was out of line 
with university policy. The EEB depart- 
ment chair gave him a similar warning, the 
report says. 

One complainant, Benedicte Shipley, 
an assistant dean in UC Irvine’s School of 
Biological Sciences, told investigators she 


Ann Olivarius, 
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UC Irvine’s investigators 
also assert that Ayala “has 
engaged in a campaign with 
the highest University officials to influence 
the outcome of this investigation.” The re- 
port says he wrote to Gillman and to Janet 
Napolitano, president of the UC system, re- 
minding them of his financial and academic 
contributions to UC Irvine. According to the 
report, Ayala told the senior investigator as 
the probe launched that it “needed to end 
quickly and in his favor and [that] he had 
lawyers waiting if [it] did not.” 

Speaking with Science last week, Ayala said 
he had wanted to avoid a protracted legal 
struggle so he could focus on his science. He 
added: “I didn’t say anything about lawyers.” 

Olivarius predicts the Ayala case will 
have an impact beyond UC Irvine. “Dr. 
Ayala’s very public punishment will send a 
loud signal that times are changing—that 
harassment ... does not mean just extreme 
misconduct,” she says. 
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NUCLEAR PHYSICS 


Electron- 

ion collider 
wins key 
endorsement 


National Academies report 
calls for accelerator to look 
inside protons and neutrons 


By Adrian Cho 


he next dream machine for U.S. 
nuclear physicists got an important 
boost this week in a report from 
the National Academies of Sciences, 
Engineering, and Medicine. The re- 
port committee glowingly approved 
of the science that could be done with the 
proposed Electron-Ion Collider (EIC), a 
billion-dollar accelerator that would probe 
the innards of protons and neutrons. The 
endorsement should help the Department 
of Energy (DOE) justify building the EIC at 
one of two national laboratories competing 
to host it, although the project probably 
won't get the go-ahead for several years. 

“We're basically saying, ‘You’ve really 
got to do this,” says Ani Aprahamian, a 
nuclear physicist at the University of Notre 
Dame in South Bend, Indiana, and co-chair 
of the report committee. 

The inner structure of the proton and 
the neutron remains mysterious. Crudely, a 
proton consists of three subatomic particles 
called quarks, bound by the strong nuclear 
force. In actuality, a proton is far more com- 
plex. Because of the uncertainties inherent 
in quantum mechanics, its interior roils 
with quark-antiquark pairs popping in and 
out of virtual existence. It also teems with 
gluons, the quantum particles that convey 
the strong force. The mess is so complex 
that even basic properties of the proton 
remain unexplained. For example, its three 
quarks account for less than 5% of its mass, 
the rest arising somehow from energy of the 
virtual quarks and gluons. 

By blasting a beam of electrons into a 
beam of protons or ions, the EIC would 
help solve this mystery and a parallel one: 
how the proton gets its spin. Just as in the 
case of mass, the proton’s spin is not simply 
the sum of the spins of the three quarks; it 
also has unknown contributions from glu- 
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ons and from the quarks orbiting around 
one another. Finally, the EIC could probe 
the gluons’ behavior for so-called emer- 
gent properties. For example, some theories 
predict that a proton’s gluons crowd into a 
single quantum wave a bit like laser light. 
The EIC would be better for such studies 
than a machine that smashes protons into 
protons, such as the Large Hadron Collider 
(LHC) near Geneva, Switzerland. That’s 
because the electron is an infinitesimally 
small particle and produces cleaner, easier 
to interpret collisions, Aprahamian says. 

The EIC would help maintain U.S. exper- 
tise in colliding beams, says Gordon Baym, 
a theorist at the University of Illinois in Ur- 
bana and co-chair of the report committee. 
“Tt’s the only collider in the U.S. that’s being 
considered for the next 50 years or so,” he 
says. Now, the United States only has the 
Relativistic Heavy Ion Collider (RHIC) at 
Brookhaven National Laboratory in Upton, 
New York, which was completed in 1999. 
Europe has the newer LHC and Japan runs 
the newly upgraded SuperKEKB electron- 
positron collider in Tsukuba. 

Physicists at two DOE labs propose to 
assemble the EIC in different ways. At 
Brookhaven, RHIC smashes together heavy 
nuclei such as gold to melt them and cre- 
ate a soup called a quark-gluon plasma, the 
stuff that filled the universe just after the 
big bang. Brookhaven physicists eventually 
want to add an electron accelerator to con- 
vert RHIC into the EIC. 

However, the Thomas Jefferson National 
Accelerator Facility in Newport News, Vir- 
ginia, has recently upgraded its Continu- 
ous Electron Beam Accelerator Facility 
(CEBAF), which fires electrons into sta- 
tionary targets to study protons, neutrons, 
and nuclei. Jefferson Lab researchers hope 


to add an ion accelerator to it to make the 
EIC. Jefferson Lab is a smaller facility that’s 
more narrowly focused on nuclear physics 
than Brookhaven, and its long-term survival 
could depend on landing the EIC. 

The report avoids comparing the two 
labs’ proposals, but offers DOE officials 
a scientific case for the EIC. However, on 
top of running RHIC and CEBAF, DOE’s 
$684 million office of nuclear physics is also 
building the $730 million Facility for Rare 
Isotope Beams at Michigan State University 
in East Lansing, which upon completion 
in 2020 will generate exotic nuclei. Given 
the costs of operating those facilities, DOE 
probably can’t afford the EIC any time soon, 
notes Donald Geesaman, a nuclear physicist 
at Argonne National Laboratory in Lemont, 
Illinois, and former chair of DOE’s Nuclear 
Science Advisory Committee. 

More time may be welcomed, anyway, as 
the Brookhaven and Jefferson Lab propos- 
als cannot yet meet key technical require- 
ments. From 1992 to 2007, physicists in 
Germany ran the Hadron-Electron Ring 
Accelerator (HERA), which also collided 
electrons and protons and revealed the 
gluon. The EIC will run at lower energy 
than HERA did, but to achieve its goals, 
it will have to generate collisions at a rate 
100 to 1000 times higher with highly polar- 
ized electron and proton beams. 

For now, Brookhaven and Jefferson Lab 
scientists are collaborating on the R&D 
rather than competing. “It’s been very excit- 
ing to see the community self-assemble,” says 
Jefferson Lab Director Stuart Henderson. 
Brookhaven Director Doon Gibbs says, “The 
immediate objective is to keep walking down 
the road together with both labs in lockstep.” 
Tf all goes well, however, the two labs’ ambi- 
tions will eventually collide. & 
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The main accelerator at Brookhaven National Laboratory could be repurposed into an electron-ion collider. 
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SCIENTIFIC ETHICS 


Study of 

‘sea nomads’ 
under fire 

in Indonesia 


Critics say study lacked 
local ethical approval and 
role for Indonesian experts 


By Dyna Rochmyaningsih 


n April, a paper showing why Indonesia’s 
Bajau people are such great divers drew 
worldwide attention as a striking exam- 
ple of recent human evolution (Science, 
20 April, p. 244). But the study, published 
in Cell, has created a different kind of stir 
in Indonesia, where some say it is an exam- 
ple of “helicopter research” carried out by 
scientists from rich countries with little con- 
sideration for local regulations and needs. 

“Too many mistakes were made here,” says 
geneticist Herawati Sudoyo, who heads the 
Eijkman Institute for Molecular Biology in 
Jakarta. Indonesian officials say the research 
team failed to obtain ethical approval from 
a local review board and took DNA samples 
out of the country without the proper paper- 
work. And some Indonesian scientists com- 
plain that the only local researcher involved 
in the study had no expertise in evolution or 
genetics. But Eske Willerslev, director of the 
University of Copenhagen’s (KU’s) Centre for 
GeoGenetics, says the team he headed had a 
permit from the Indonesian government and 
worked hard to follow the rules. “I would 
never participate in research that I felt was 
unethical,” Willerslev says. The government 
hasn’t informed him about problems, he 
says, but, “If we have made an error that vi- 
olates national or international guidelines, 
we would like to apologize for that.” 

The issue escalated in late May, when 
Pradiptajati Kusuma, a geneticist at the 
Eijkman Institute who has also studied the 
Bajau, suggested in a tweet that the team 
could have faced prosecution under strict 
new rules on foreign research, proposed by 
the Indonesian government and now under 
debate. “Jail? Possible,” Kusuma wrote. He 
later deleted the tweet, but Melissa Ilardo, 
the Cell study’s first author, says she was 
so rattled that she canceled a July trip to 
Indonesia during which she planned to 
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inform the Bajau about her study. “I did 
everything I could to conduct this research 
ethically and properly, and this is breaking 
my heart,” says Ilardo, a Ph.D. student at KU 
at the time of the fieldwork and now at the 
University of Utah in Salt Lake City. 

Sometimes called sea nomads, the Bajau 
have lived off the ocean for centuries; men 
spend much of the day underwater to spear 
fish and harvest sea cucumbers. In 2015, 
llardo took saliva samples from 59 Bajau in- 
dividuals in Central Sulawesi and measured 
their spleen size. The team found that, com- 
pared with controls, the Bajau have bigger 
spleens, which may help prevent hypoxia 
during long dives by releasing extra blood 
cells. The researchers also identified a gene 
variant that may be responsible. 

Willerslev’s group received a permit for 
the study from Indonesia’s Ministry of Re- 
search, Technology and Higher Education, 
commonly known as RISTEK, in Jakarta and 
ethical clearance from the Danish National 
Committee on Health Research Ethics. “We 
were told that RISTEK permit included local 
ethical approval as well, thus there was no 
ethical violation,” Willerslev says. 

Sadjuga, secretary of RISTEK’s Foreign 
Research Permit Coordinating Team, dis- 
putes that account. “We always request ethi- 
cal clearance from at least one Indonesian 
research ethics commission,” Sadjuga says. 
(Like many Indonesians, he uses only one 
name.) Triono Soendoro, who heads the Ethi- 
cal Commission for National Health Research 
and Development at the Indonesian Ministry 
of Health in Jakarta, confirms that the team 
should have had approval from an ethi- 
cal panel in Indonesia; guidelines from the 
Council for International Organizations of 
Medical Sciences also call for local approval. 

The team may also have run afoul of 
regulations when it shipped DNA samples 
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to Copenhagen for analysis. Ilardo says 
she filed a material transfer agreement 
(MTA)—a contract governing the shipment 
of research samples—with her application 
to RISTEK. But for the transfer of human 
DNA, she should have sought approval from 
the National Institute of Health Research 
and Development in Jakarta, says Siswanto, 
who chairs that institute. “If this was a 
requirement, I would have expected that 
RISTEK would have told me if my MTA was 
invalid when I submitted it,” Ilardo says. 

Some Indonesian scientists, meanwhile, 
are miffed that the only Indonesian name on 
the paper is that of Suhartini Salingkat, an 
education researcher at Tompotika Luwuk 
Banggai University, a small private institu- 
tion in Central Sulawesi; according to the 
paper, she “provided logistical support.” For- 
eign teams “should involve Indonesian scien- 
tists in all stages of research,” says Mohamad 
Belaffif, an Indonesian bioinformatician at 
the HudsonAlpha Institute for Biotechnol- 
ogy in Huntsville, Alabama. 

Ilardo says she did try to collaborate 
with scientists at the Eijkman Institute af- 
ter RISTEK requested she do so. An email 
exchange between Ilardo and Sudoyo, 
provided to Science by Willerslev, shows 
Sudoyo didn’t respond to several requests 
for a meeting in October 2015, before field- 
work began, and later effectively declined 
a partnership. “As far as I understand, you 
have your own partner already in the Ba- 
jau project, therefore we are not needed,” 
she wrote. (Sudoyo declined to answer 
Science’s questions on this matter.) Given 
Tlardo’s overtures to the Eijkman Institute, 
“IT would love to understand what went 
wrong and why they suddenly are so an- 
gry,” says Rasmus Nielsen of the University 
of California, Berkeley, another senior au- 
thor on the paper. 
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Acontroversial study showed that the Bajau are good 
divers thanks in part to an enlarged spleen. 


llardo says she shared her genetic exper- 
tise with Tompotika students in an informal 
seminar, and made the partnership worth- 
while for Salingkat by helping her with a 
research paper. In Ilardo’s application to 
RISTEK, she also promised to organize a 
meeting with the Bajau people to tell them 
about the results of the study. But even if 
she hadn’t abandoned that plan follow- 
ing Kusuma’s tweet, some argue it would 
have been too late. “In general, the return 
[of research results] should coincide with 
or slightly precede publication so that the 
participants are not the last to know,’ says 
Conrad Fernandez, a bioethicist at Dalhou- 
sie University in Halifax, Canada. 

Berry Juliandi, a biologist at Bogor Agri- 
cultural University in Indonesia, says the 
country’s “tangled” system of permits can 
be hard to navigate for foreign researchers. 
“The root of this problem is the weak man- 
agement of foreign research permits in Indo- 
nesia,” he says. “How could RISTEK approve 
Ilardo’s permit proposal [when] she doesn’t 
have a valid MTA and ethical clearance from 
an Indonesian institution?” Working with a 
bigger, more experienced local institution 
than Tompotika might have helped the re- 
searchers avoid pitfalls, he says. 

The case comes at a sensitive time, when 
Indonesian and foreign scientists are de- 
bating rules, proposed in 2017, that would 
strengthen MTA regulations, compel for- 
eign researchers to include Indonesian 
colleagues as “equal partners” on projects, 
and include them as authors on every peer- 
reviewed paper about the work. Outside 
researchers would also have to submit raw 
data to the country’s research ministry; 
some violations would carry prison sen- 
tences. Some scientists, both in Indonesia 
and abroad, say the law is unworkable and 
could stifle scientific progress. At the same 
time, RISTEK says it wants to promote 
research collaborations, and on 5 July, it 
launched an online system that makes the 
paperwork easier and less time-consuming 
for foreign researchers. 

Neither RISTEK nor the Ministry of 
Health has taken action against the re- 
searchers over the Bajau study. A spokes- 
person for Cell says the journal is satisfied 
by the researchers’ explanation. “The au- 
thors sent us documentation indicating that 
they received consent from the Indonesian 
government to conduct this research,” he 
says. “We have no evidence that further in- 
vestigation of this matter is warranted.” & 


Dyna Rochmyaningsth is a science journal- 
ist based in Deli Serdang, Indonesia. 
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By Daniel Clery 


ar beneath the deeply frozen ice cap 

at Mars’s south pole lies a lake of liq- 

uid water—the first to be found on 

the Red Planet. Detected from orbit 

using ice-penetrating radar, the lake 

is probably frigid and full of salts—an 
unlikely habitat for life. But the discovery, 
reported online in Science this week, is sure 
to intensify the hunt for other buried layers 
of water that might be more hospitable. “It’s 
a very exciting result: the first indication of 
a briny aquifer on Mars,” says geophysicist 
David Stillman of Southwest Research In- 
stitute in Boulder, Colorado, who was not a 
part of the study. 

The lake resembles one of the inter- 
connected pools that sit under several kilo- 
meters of ice in Greenland and Antarctica, 
says Martin Siegert, a geophysicist at Im- 
perial College London, who heads a con- 
sortium trying to drill into Lake Ellsworth 
under West Antarctica. But the processes 
that gave rise to a deep lake on Mars are 
likely to be different. “It will open up a very 
interesting area of science on Mars,” he says. 

Water is thought to have flowed across 
the surface of Mars billions of years ago, 
when its atmosphere was thicker and 
warmer, cutting gullies and channels that 
are still visible. But today, low atmospheric 
pressures mean that any surface water 
would boil away. Water survives frozen in 
polar ice caps and in subsurface ice depos- 
its. Some deposits have been mapped by 
the Mars Advanced Radar for Subsurface 
and Ionospheric Sounding (MARSIS), an in- 
strument on the European Space Agency’s 
Mars Express orbiter, which launched in 
2003. MARSIS beams down pulses of radio 
waves and listens for reflections. Some of 
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the waves bounce off the surface, but oth- 
ers penetrate up to 3 kilometers and can be 
reflected by sharp transitions in the buried 
layers, such as going from ice to rock. 

Several years into the mission, MARSIS 
scientists began to see small, bright echoes 
under the south polar ice cap—so bright 
that the reflection could indicate not just 
rock underlying the ice, but liquid water. 
The researchers doubted the signal was 
real, however, because it appeared in some 
orbital passes but not others. 

Later the team realized that the space- 
craft’s computer was averaging across pixels 
to reduce the size of large data streams— 
and in the process, smoothing away the 
bright anomalies. “We were not seeing the 
thing that was right under our noses,” says 
Roberto Orosei, a principal investigator (PI) 
for MARSIS at the Italian National Institute 
for Astrophysics in Bologna. 

To bypass this problem, the team com- 
mandeered a memory chip on Mars Express 
to store raw data during short passes over 
intriguing areas. Between 2012 and 2015, 
the spacecraft confirmed the existence of 
the bright reflections during 29 passes over 
the south polar region. The brightest patch, 
offset 9° from the pole, lies 1.5 kilometers 
under the ice and spans 20 kilometers, 
Orosei and his colleagues report. 

The radar brightness alone isn’t enough 
to prove that liquid water is responsible. 
Another clue comes from the permittiv- 
ity of the reflecting material: its ability to 
store energy in an electric field. Water has 
a higher permittivity than rock and ice. Cal- 
culating permittivity requires knowing the 
signal power reflected by the bright patch, 
something the researchers could only esti- 
mate. But they find the permittivity of the 
patch to be higher than anywhere else on 
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Liquid water lies 
unseen under ice 
(white) at Mars’s 
south pole in an image 
from Mars Express. 


Mars—and comparable to the subglacial 
lakes on Earth. Although the team cannot 
measure the thickness of the water layer, 
Orosei says it is much more than a thin film. 

Not everyone on the MARSIS team is 
convinced. “I would say the interpretation 
is plausible, but it’s not quite a slam dunk 
yet,” says Jeffrey Plaut, the other MARSIS 
PI at NASA’s Jet Propulsion Laboratory in 
Pasadena, California, who is not an author 
on the study. 

After all, it isn’t easy to explain the pres- 
ence of water at Mars’s south pole. In Earth’s 
polar regions, the pressure of the overlying 
ice lowers its melting point, and geothermal 
heat warms it from below to create the sub- 
glacial lakes. But there’s little heat flowing 
from the geologically dead interior of Mars, 
and under the planet’s weak gravity, the 
weight of 1.5 kilometers of ice does not lower 
the melting point by much. Orosei suspects 
that salts, especially the perchlorates that 
have been found in the planet’s soils, could 
be lowering the ice’s melting point. “They are 
the prime suspects,” he says. 

High levels of salt and temperatures 
dozens of degrees below zero do not bode 
well for any microbes trying to live there, 
Stillman says. “If martian life is like Earth 
life, this is too cold and too salty.” But he 
says researchers will want to look for other 
lakes under the ice and find out whether 
they are connected—and whether they 
point to an even deeper water table. 

Lakes might even turn up at lower, 
warmer latitudes—a location more suitable 
for a martian microbe, says Valérie Ciarletti 
of the University of Paris-Saclay, who is de- 
veloping a radar instrument for Europe’s 
ExoMars rover, due to launch in 2020. “The 
big, big finding would be water at depth 
outside the polar cap.” 
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ELECTRICAL ENGINEERING 


Chipmakers look past Moore's law, and silicon 


Novel computer chip materials explored in $1.5 billion U.S. military research program 


By Robert F. Service 


ilicon computer chips have been on 

a roll for half a century, getting ever 

more powerful. But the pace of in- 

novation is slowing. This week, the 

U.S. military’s Defense Advanced Re- 

search Projects Agency (DARPA) an- 
nounced dozens of new grants totaling $75 
million in a program for academic and in- 
dustry scientists. The program, called the 
Electronics Resurgence Initiative, aims to 
spur new chip designs and materials, such 
as carbon nanotubes. Over the 
next few years, the program 
will grow to $300 million per 
year, up to a total of $1.5 billion 
over 5 years. 

“It’s a critical time to do 
this,” says Erica Fuchs, a com- 
puter science policy expert at 
Carnegie Mellon University in 
Pittsburgh, Pennsylvania. “It’s a 
good first step.” 

In 1965, Intel co-founder 
Gordon Moore made the ob- 
servation that would become 
his eponymous “law”: The 
number of transistors on chips 
was doubling every 2 years, a 
time frame later cut to every 
18 months. But the gains from 
miniaturizing the chips are 
dwindling. Today, chip speeds 
are stuck in place, and each 
new generation of chips brings 
only a 30% improvement in 
energy efficiency, says Max 
Shulaker, an electrical engineer 
at the Massachusetts Institute of Technol- 
ogy in Cambridge. Fabricators are ap- 
proaching physical limits of silicon, says 
Gregory Wright, a wireless communica- 
tions expert at Nokia Bell Labs in Holm- 
del, New Jersey. Electrons are confined to 
patches of silicon just 100 atoms wide, he 
says, forcing complex designs that prevent 
electrons from leaking out and causing er- 
rors. “We're running out of room.” 

Moreover, only a handful of companies 
can afford the multibillion-dollar fabrica- 
tion plants that make the chips, stifling in- 
novation in a field once dominated by small 
startups, says Valeria Bertacco, a computer 
scientist at the University of Michigan in 
Ann Arbor. And some big companies are 
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going down separate paths, designing spe- 
cialized chips for specific tasks, Fuchs says. 
That has reduced the incentive for them 
to pay for shared, precompetitive basic re- 
search. The number of companies involved 
in the Semiconductor Research Corpora- 
tion in Durham, North Carolina, which 
backs such work, dropped from 80 in 1996 
to less than half that in 2013, according to 
a study by Fuchs and her colleagues. 
DARPA is now trying to fill the gap, with 
grants to researchers such as Shulaker. 
He is fashioning 3D computer chips with 


Awafer contains hundreds of computer chips made from carbon nanotubes, which 
switch on and off faster and more efficiently than transistors made from silicon. 


transistors made of carbon nanotubes, 
which switch on and off faster and more 
efficiently than silicon transistors. Com- 
panies today already make 3D chips with 
silicon as a way to pack logic and memory 
functions closer together to speed up pro- 
cessing. But the chips are slowed down 
by bulky and sparse wiring that carries 
information between the chip layers. And 
because 2D silicon chip layers must be fab- 
ricated separately at more than 1000°C, 
there is no way to build up 3D chips in an 
integrated fabrication plan without melt- 
ing the lower layers. 

Carbon nanotube transistors, which can 
be made at nearly room temperature, offer 
a better path to dense, integrated 3D chips, 
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Shulaker says. Even though his team’s 3D 
chips will have features 10 times larger 
than state-of-the-art silicon devices, their 
speed and energy efficiency is expected to 
be 50 times better—a potential boon for 
power-hungry data centers. 

The DARPA program is also supporting 
research into flexible chip architectures. 
Daniel Bliss, a wireless communications 
expert at Arizona State University in 
Tempe, and his colleagues want to improve 
wireless communications with chips that 
can be reconfigured on the fly to carry out 
specialized tasks. Bliss is work- 
ing on radio chips that mix and 
filter signals using software 
rather than hardware—an ad- 
vance that would allow a larger 
number of devices to transmit 
and receive signals without in- 
terference. This could improve 
mobile and satellite commu- 
nications, as well as enable a 
rapid growth in the “internet 
of things,’ where myriad de- 
vices communicate with one 
another, he says. 

Another DARPA grant, for 
researchers at Stanford Uni- 
versity in Palo Alto, California, 
will go to improving computer 
tools used in chipmaking. These 
tools verify novel chip designs 
with a form of artificial intel- 
ligence called machine learn- 
ing. They would help automate 
the largely manual process 
of detecting design bugs in 

chips made up of billions of tran- 
sistors, and could speed up the ability of 
companies to design, test, and fabricate 
new chip architectures. 

If even a fraction of the new projects 
succeed, the DARPA project “will com- 
pletely revolutionize how we design elec- 
tronics,” says Subhasish Mitra, a Stanford 
electrical and computer engineer, and a 
researcher on the 3D carbon nanotube and 
circuit validation projects. He says it will 
also encourage engineers to look beyond 
silicon, which has dominated research for 
decades. “When I was a student, life was 
boring,” Mitra says. “It was clear that sili- 
con would move forward along a known 
path. Now, it’s absolutely clear that’s not 
what the future is.” 
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THE TRAILBLAZER 


Joanna Harper is racing to reveal how a gender 
transition alters an athlete’s physiology and performance 


By Katherine Kornei, in Portland, Oregon, and Phoenix 


PHOTO: BETH NAKAMURA 
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Joanna Harper’s 
personal journey 
and love of running 
have shaped her 
research interests. 


oanna Harper swallowed a few pills 
in late August 2004, a day after run- 
ning in the Hood to Coast relay race 
between Oregon’s highest moun- 
tain and the Pacific Ocean. They 
delivered a small dose of estrogen 
and a testosterone blocker and set 
in motion changes that Harper, who 
was designated male at birth and 
raised as a boy, had imagined since child- 
hood. Harper’s timing was deliberate—the 
47-year-old nationally ranked runner wanted 
one more race before disrupting her hor- 
mones because she knew she'd never run as 
fast again. 

The testosterone that courses through a 
man’s body after puberty triggers and main- 
tains a slew of physical changes: Men, whose 
levels of the hormone are usually some 10 to 
15 times those of women, typically have larger 
muscles, denser bones, and higher frac- 
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tions of lean body mass than women. That 
hormone-fueled transformation confers cer- 
tain athletic advantages, and men on average 
run faster, lift more weight, and throw harder 
and farther than women. Sporting events are 
therefore usually split into male and female 
categories to ensure fair competition. But 
this division of the sexes, which has existed 
for as long as women have competed as ath- 
letes, forces an important question: Who, at 
least from an athletic standpoint, is female? 

Many people believe transgender women 
such as Harper have athletic advantages over 
non-transgender women—sometimes called 
cisgender women—because of their previous 
exposure to male levels of testosterone. But 
Harper, a medical physicist at a large medi- 
cal center in Portland, Oregon, has been chal- 
lenging that assumption with data. In 2015, 
she published the first study of transgender 
athletes’ performances, finding that trans- 
gender women who received treatment to 
lower their testosterone levels did no better 
in a variety of races against female peers than 
they had previously done against male run- 
ners. Although Harper’s study included only 
a few transgender women, Eric Vilain of The 
George Washington University in Washing- 
ton, D.C., a geneticist who specializes in gen- 
der-based biology, calls it “groundbreaking.” 

That work helped make Harper an unpaid 
adviser for sporting bodies, such as the Inter- 
national Olympic Committee (IOC), that are 
wrestling with transgender issues and other 
matters of gender. Although Harper has just 
a master’s degree, she is helping spearhead 
several studies documenting how the physio- 
logy and performance of transgender athletes 
change as they make their transition. 

Harper may not have the traditional pedi- 
gree of a scientist, Vilain says, but “her ap- 
proach is highly respected.” Harper has made 
“very important” contributions to under- 
standing gender and sports performance, an- 
ticipating the debate that now swirls around 
transgender and “intersex” athletes, adds 
Stéphane Bermon, a sports physician and ex- 
ercise physiologist at the Monaco Institute of 
Sports Medicine and Surgery. 


IN THE 1960s, elite female athletes had to 
prove their sex by showing sporting officials 
their genitalia. Those unpopular “nude pa- 
rades” were soon replaced by chromosomal 
tests, which had their own problems. Biology 
does not always cleanly divide human beings 
into two sexes. Some people, often described 
as intersex, have unusual sex chromosome 
arrangements or physical attributes such as 
ambiguous genitalia. Others have an innate 
sense of gender differing from the sex they 
were assigned at birth; they often choose to 
hormonally and/or surgically transition to 
the gender they identify as. 
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Harper, who grew up in Parry Sound, a 
small town about 250 kilometers north of 
Toronto, Canada, is among the latter. The 
oldest child of a high school teacher and a 
nurse, Harper wore her mother’s and sister’s 
clothing in private from a young age. Leav- 
ing elementary school one day, Harper asked 
a male friend an earnest question—had he 
ever thought, as Harper had, about being 
a girl? The boy recoiled. “It was clear from 
the way that my friend looked at me that I 
was never going to ask anybody that again,” 
Harper says. 

In high school, Harper threw herself into 
sports. She was best known as a star basket- 
ball player, averaging 18 points per game as 
a guard on the boys’ team her senior year. 
But Harper was short for the men’s college 
game. Instead, she ran track and cross- 
country as an undergraduate at what was 
then the University of Western Ontario in 
London, Canada, where she majored in phys- 
ics. In 1982, Harper earned a master’s degree 
in medical physics, training that prepared 
her to determine radiation treatment dosages 
for cancer patients. The career choice was, in 
part, lifestyle-oriented. “It was clear medical 
physicists were not putting in huge hours, 
and it would allow me time to run,” she says. 

Harper moved to Oregon a few years later 
to be closer to family in the Canadian prov- 
ince of British Columbia and to take a job 
in Portland. But her gender dysphoria al- 
ways lurked. By 2004, “I was very close to a 
mental breakdown,” she says. She started to 
see a therapist, who asked her to list what 
she’d lose if she transitioned genders. Near 
the top of that list was competitive sports— 
transgender athletes had no official opportu- 
nities to compete at the time. 

Barely 3 months later, in May 2004, IOC 
announced a landmark ruling: Transgender 
athletes could participate in the Olympics. 
Strict stipulations were in effect for trans- 
gender women competing in women’s 
events—they needed sex reassignment sur- 
gery, legal recognition of their gender by 
“the appropriate official authorities,’ and at 
least 2 years of verifiable hormone replace- 
ment therapy—but it was “a light at the end 
of the tunnel” for Harper. She wasn’t an 
Olympic-level athlete, but she believed that 
change would trickle down to other sports 
organizations and she would be able to 
compete again. 

Harper started hormone replacement 
therapy in August 2004. Spironolactone, a 
testosterone blocker and diuretic, flushed 
the testosterone her body was producing, 
and estradiol, a form of estrogen, began to 
make her form more typically feminine. In 
just a few weeks, Harper noticed changes 
such as breast tenderness and a decrease in 
body hair. The transition was a “very, very, 
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very difficult time,” Harper says. She lost most 
of her male friends, and her mother—her 
only living immediate family member—was 
not supportive. “When it became clear that I 
was going to go through with my transition, 
my mother said she never wanted to see me 
again,’ Harper recalls. Barbara Harper, who 
died in 2013, eventually relented, in a way. 
“When I visited, she didn’t tell anyone who 
I was.” “This is Joanna,’ was all her mother 
would say. 

Today, the 61-year-old with collarbone- 
length red hair describes herself as a scien- 
tist, an athlete, and a transgender person—in 
that order. But being transgender can some- 
times overshadow everything else. “We joke 
in the transgender world that if Hitler had 
been transgender, he would be described as 
‘that transgender dictator’? That’s the cat- 
egory that everybody wants to put you in.” 


IN 1976, RENEE RICHARDS entered a New 
Jersey professional tennis tournament. 
Richards, who had enrolled at Yale Uni- 
versity as Richard Raskind and captained 
its men’s tennis team before undergoing 
sex reassignment surgery, was met with 
open hostility—more than 20 female play- 
ers boycotted the competition in protest 
of her perceived advantages. Indeed, some 
physical attributes such as hand size and 
height—like Richards’s tall frame—remain 
largely unchanged after hormone therapy, 
Harper says. 

Many people expect other physical ad- 
vantages to linger, too. Men generally have 
blood with higher oxygen-carrying capacity 
because testosterone stimulates bone mar- 
row to produce more red blood cells, says 
Siddhartha Angadi, a cardiovascular physio- 
logist at Arizona State University in Phoe- 
nix. Male bodies are also generally leaner, 
and carry less body fat—“an obvious benefit 
when it comes to athletic performance,” 
Angadi says. 

Some people therefore insist that trans- 
gender women and many intersex athletes 
competing in women’s events will always 
have an unfair edge. (Little controversy ex- 
ists over transgender men in sports, as many 
expect them to be at a disadvantage.) Others 
believe athletes should be able to compete 
in their self-identified gender without regu- 
lations. Harper wants to address the ques- 
tion with data. “You have to go to science.” 

Before her own transition in 2004, 
Harper expected that her 10,000-meter race 
time might increase by “a minute or two” 
as her testosterone level dropped and she 
slowed. But in less than a year, Harper was 
running a full 5 minutes slower than her 
personal best. “It just blew me away, and it 
very much piqued my interest as a scientist.” 

In 2005, Harper realized her experience 
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wasn’t unique after reading an article in 
Runner’s World about another transgender 
female runner who had also become signifi- 
cantly slower. But when Harper searched 
for studies about the physiology of transi- 
tioning, she found none. So on nights and 
weekends, she began to moonlight on a re- 
search project. 

Harper searched for transgender female 
distance runners willing to share race times 
from before and after their transitions. The 
transgender population, even now, tends to 
be “small and secretive,’ Harper says, and it 
took 7 years of contacting athletes through 


Different, but same 

A study of transgender women found their race times 
slowed after transitioning, but their age grades, which 
compare people to the best runners of the same sex 
and age, hardly changed, suggesting they have no 
advantage over non-transgender women. 


Half-marathon 
race times 


Male Female 


Age grade 
difference 


5K race times Female 


— -3.7 
+2.6 
-3.0 


-3.0 
+2.0 


+0.6 


Yahoo and Facebook groups to collect data 
from eight runners. All the women had under- 
gone hormone therapy to bring their tes- 
tosterone levels in line with typical female 
levels. In Harper’s study, titled simply “Race 
Times for Transgender Athletes” and pub- 
lished in 2015 in the little-known Journal of 
Sporting Cultures and Identities, she showed 
that all but one person ran substantially 
slower after transitioning. 

Harper also calculated each subject’s age 
grade, a common metric in track and field 
and distance running that reflects an ath- 
lete’s performance compared with the fast- 
est known time by someone of the same age 
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and sex. Harper showed that the athletes’ 
age grades before and after hormone ther- 
apy remained nearly the same. That is, the 
women were as competitive with their age- 
and sex-matched peers as they had been 
when competing against men. They weren’t, 
in other words, likely to dominate women’s 
races. “No one had previously looked at ac- 
tual performance of transgender athletes 
pre- and posttransition,” Vilain says. 

Harper has since shown similar results 
for a transgender rower, a cyclist, and a 
sprinter. Together, the findings make a case 
that previous exposure to male levels of 
testosterone does not confer an enduring 
athletic advantage. 

In 2015, IOC invited Harper to attend 
its Consensus Meeting on Sex Reassign- 
ment and Hyperandrogenism held in Lau- 
sanne, Switzerland. After 3 days, the panel 
of scientists and physicians converged on 
revised rules for transgender competitors, 
including at least 1 year of hormone re- 
placement therapy for female competitors, 
rather than the 2 years previously required. 
That change was a nod to Harper’s personal 
transition experience and to research pub- 
lished in 2004 in the European Journal of 
Endocrinology showing that the testoster- 
one levels—and therefore performance— 
of 19 transgender women stabilized after 
12 months of hormone therapy. The revised 
IOC policy also lifted the requirement for 
sex reassignment surgery. That decision 
was a long time coming, Harper says. “What 
your genitals are doesn’t make a difference.” 

Less settled, however, is the debate about 
the appropriate upper limit of women’s 
testosterone levels in elite athletic compe- 
tition. The current IOC policy dictates that 
transgender women must have a testoster- 
one level less than 10 nanomoles per liter, 
roughly the low end of typical male values. 
But because more than 99% of women have 
testosterone levels less than 3 nanomoles 
per liter, some researchers have suggested 
that limit is too high. Harper is among 
them. “If you’re competing in the women’s 
division, you should do so with women’s 
hormone levels,’ she says. “I understand 
just how much difference they make.” 

South African runner Caster Semenya, 
who has always competed in women’s races 
and won Olympic gold in Rio de Janeiro, 
Brazil, in 2016, recently refocused atten- 
tion on the testosterone issue. In 2009, 
the International Association of Athlet- 
ics Federations (IAAF)—the Stockholm- 
based, world-governing body for track and 
field—controversially required her to take a 
sex-verification test after she breezed past 
competitors in the 800-meter race at the 
IAAF World Championships. The results, 
leaked during the competition, allegedly 
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revealed that Semenya was intersex and 
had three times the testosterone of a typi- 
cal woman. Neither she nor IAAF has ever 
confirmed that publicly, however. 

In April, IAAF issued a policy that many 
groups—including Athletics South Africa, 
the country’s athletics federation—view 
as targeting Semenya. It applies to women 
competing in certain track and field events, 
including those that Semenya excels at, who 
have specific intersex conditions in which 
their bodies produce and are sensitive to 
higher levels of testosterone. Drawing on 
performance and hormone data from an 
IAAF-sponsored study of athletes compet- 
ing at its recent World Championships, 
which was published last year in the British 
Journal of Sports Medicine, the policy re- 
quires testosterone levels below 
5 nanomoles per liter, half the 
previous IAAF threshold. The 
goal is “leveling the playing field 
to ensure fair and meaningful 
competition,” IAAF President 
Sebastian Coe said in a statement. 

In June, Semenya called that 
policy discriminatory and said 
she would challenge it at the 
Court of Arbitration for Sport. 
“T just want to run naturally, 
the way I was born,” the 27-year- 
old told media. “I am a woman 
and I am fast.” And earlier this 
month, three researchers who 
analyzed a subset of data from 
the 2017 study claimed to have 
found “significant problems 
and anomalies” and called for 
its retraction. IAAF updated the 
paper, but told The New York 
Times, which first reported the 
flap, that “the conclusions re- 
main the same.” 

More controversy may be on 
the way: Within the next few 
months, IAAF is expected to is- 
sue updated testosterone-based regulations 
for transgender women as well. IOC also 
plans to announce new testosterone limits 
for athletes in women’s events, which will 
be in effect for the 2020 Summer Olympics 
in Tokyo. 


TO GET A CLEARER PICTURE of how changing 
hormone levels affect an athlete’s body, 
Harper and others want to collect data from 
people during their transitions. “It’s impor- 
tant to know more about lean body mass; 
hemoglobin concentration; and psycho- 
logical, endocrine, and metabolic changes 
during transition,” says Bermon, who is also 
a scientific consultant for IAAF. 

Harper recruited the athletes for the first 
such studies. One focuses on a 28-year-old 


SCIENCE sciencemag.org 


distance runner named Lauren. (The athletes 
undergoing testing requested that only their 
first names be used in this story.) Roughly 
once a month, Lauren makes the 45-minute 
drive from her home in Queen Creek, Ari- 
zona, to downtown Phoenix to undergo a bat- 
tery of tests in Angadi’s lab. 

He and his team have been measur- 
ing, among other things, the elasticity of 
Lauren’s arteries, her bone density and dis- 
tribution of fat, and how the myocardial 
fibers of her heart twist and untwist with 
each beat. After she started estrogen in- 
jections last fall, Lauren’s blood pressure 
dropped by about 10 points and her body fat 
increased, Angadi says. Those changes are 
small, he cautions, and more measurements 
are needed. Another year of data may reveal 


Caster Semenya’s impressive victories in women’s races have ignited discussions 
on testosterone limits in athletes. 


a decline in bone density. “Bone is a really 
slow-turnover organ,” he says. 

During her visit in April, Lauren pulled 
on a bright blue mask that covered her 
nose and mouth, and Andrew D’Lugos, 
an exercise physiologist, had her start to 
run on a treadmill. “Way to go, Lauren!” 
“Enjoy the flat!” and “Looking good!” he 
and other scientists called out. D’Lugos 
cranked the treadmill up to 13, then 14 kilo- 
meters per hour. A large screen displayed 
Lauren’s heart rate and oxygen consumption, 
new points popping up every 15 seconds. 
D’Lugos gradually increased the treadmill’s 
incline, effectively forcing Lauren to run up 
an ever-steeper hill at her marathon pace. 
When at last she could go no farther, she 
pushed down on the treadmill’s side railings, 
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lifting her thin frame above the spinning 
black belt, her head bent in exhaustion. 

Within 6 weeks after she started estrogen 
injections, Lauren’s peak oxygen consump- 
tion rate—a measure of fitness—fell by 17%, 
the researchers reported at the American 
College of Sports Medicine’s meeting in 
Minneapolis, Minnesota, in June. “She lost 
a fair bit of performance really quickly,’ 
Angadi says. But Lauren’s fitness probably 
hasn’t stabilized yet, he notes, because she 
only recently started to take a testosterone 
blocker. Angadi’s team will test Lauren into 
2019, when she runs the Boston Marathon. 
Charissa, a triathlete living in Colorado who 
is taking part in a similar study, lost roughly 
15% of her aerobic capacity in 9 months 
since beginning hormone therapy, Harper 
reported in March at a British 
Association of Sport and Exer- 
cise Medicine conference held in 
Doncaster, U.K. 

There, Harper announced that 
she and Yannis Pitsiladis—an 
exercise physiologist at the Uni- 
versity of Brighton in the United 
Kingdom best known for his so- 
far futile efforts to train a man 
to run a marathon in less than 
2 hours—plan to monitor roughly 
20 men and 20 women as they 
transition. The largest study of its 
kind, it will recruit subjects from 
a London-based gender clinic and 
enlist the expertise of endocri- 
nologists, muscle physiologists, 
and mental health professionals, 
among others. The undertaking 
will be enormous, Harper ad- 
mits. “We’re going to need help,” 
she told the Doncaster audience. 
Pending research funding from 
the U.K. government, the study 
will begin in 2019. 

Harper still has her dayjob, and 
she’s writing a book about gender 
variance in sports. Fortunately, a refuge from 
all the demands on her time—and the con- 
troversies that come with her research—is 
just a block and a half from her front door. 
Several times a week, Harper turns left from 
her house toward Mount Tabor Park, one of 
Portland’s largest. There, she runs 95 kilo- 
meters a week, often with friends, on trails 
that weave among 100-year-old cedar, spruce, 
and redwood trees. Just like the paths that 
sometimes open up to reveal fleeting views 
of Mount Hood in the distance, Harper’s life 
has had its share of twists and turns. But she 
doesn’t regret the decision she made 14 years 
ago. “I became a much happier person.” & 


Katherine Kornei is a science journalist 
in Portland, Oregon. 
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Reducing uncertainties in climate models 


Implementing accurate calculations of radiative forcing can improve climate projections 


By Brian J. Soden’, William D. Collins”, 
Daniel R. Feldman? 


adiative forcing is a fundamental 
quantity for understanding both an- 
thropogenic and natural changes in 
climate. It measures the extent to 
which human activities [such as the 
emission of carbon dioxide (CO,), see 
the image] and natural events (such as volca- 
nic eruptions) perturb the flow of energy into 
and out of the climate system. This perturba- 
tion initiates all other changes of the climate 
in response to external forcings. Inconsisten- 
cies in the calculation of radiative forcing by 
CO, introduce uncertainties in model projec- 
tions of climate change, a problem that has 
persisted for more than two decades. The 
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explicit calculation of radiative forcing and 
a careful vetting of radiative transfer param- 
eterizations provide a straightforward means 
to substantially reduce these uncertainties 
and improve the projections. 

CO, is the main forcing agent in both 
20th- and 2ist-century emission scenarios 
(1). Twenty-five years ago, Cess et al. pro- 
vided the first comprehensive assessment 
of the calculation of radiative forcing by CO, 
in global climate models (GCMs) (2). They 
found that when CO, was doubled, the radia- 
tive forcing differed substantially among 15 
different GCMs, ranging from ~3.3 to 4.7 W/ 
m/? (see the graph; see the supplementary ma- 
terials for further details). This spread mainly 
arose from intermodel differences in the pa- 
rameterization of infrared absorption by CO,,. 
Other sources of differences, such as the pa- 
rameterization of overlapping absorption by 
water vapor or differences in the cloud distri- 
butions, were shown to be small. 

Thirteen years later, Collins et al. con- 
ducted a more extensive intercomparison of 
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radiative forcing, using a newer generation 
of more than 20 different GCMs (3). They 
found a similar range in radiative forcing at 
the top of the atmosphere for a doubling of 
CO, (see the graph), which again was largely 
due to spread in the infrared component of 
CO, absorption. The authors also compared 
the radiative forcing computed using line- 
by-line (LBL) calculations; the latter solve 
the equation of radiative transfer for each 
absorption line individually, rather than pa- 
rameterizing their absorption over spectrally 
integrated bands. The forcing calculations 
between several different LBL models were 
in much better agreement (see the graph). 
The LBL calculations have also been exten- 
sively validated by using both laboratory 
and field measurements (4), and the spectro- 
scopic foundation for this radiative forcing is 
quite robust (5). The agreement among LBL 
models forms the basis for the narrow un- 
certainty range for CO, forcing noted in the 
Intergovernmental Panel on Climate Change 
(IPCC) reports (1). However, LBL calculations 
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As noted by Cess et al., the impact of this 
inconsistency in the calculations of radiative 
forcing on estimates of climate sensitivity “is 
nearly half of the often quoted range of uncer- 
tainty of 1.5° to 4.5°C.” Thus, even if we could 
make all other aspects of the models per- 
fect, the spread in projections of CO,- 
induced climate change would only be 
reduced by 50% because of the remain- 


oughly vetted against LBL calculations would 
significantly reduce the spread in model pro- 
jections. It would also reduce discrepancies in 
the parameterization of other key absorbers, 
such as water vapor, that also affect model 
calculations of climate sensitivity. Last, it 


Reducing the uncertainty 


ing differences in radiative forcing. 
The contributions of erroneous CO, 
forcing to the persistent spread in 


Radiative forcing uncertainty in GCMs has remained 
high over the past 25 years. LBL calculations show that 
this uncertainty can be substantially reduced. 


GRAPHIC: ADAPTED BY N. CARY/SCIENCE 


Asnapshot from a model simulation 
of CO, traveling through Earth’s 
atmosphere over the course of a year. 


are computationally expensive, and param- 
eterized models of radiative transfer must be 
used in GCMs. Unfortunately, substantial dif- 
ferences still exist in these parameterizations. 
Chung and Soden found that the spread in 
CO, forcing from the most recent generation 
of GCMs remains largely unchanged com- 
pared with that documented in previous gen- 
erations (see the graph) (6). 

The precise measure of radiative forcing 
differs slightly between these three studies 
(7). As a result, their absolute values of ra- 
diative forcing are not directly comparable. 
However, the relative spread in radiative forc- 
ing between models is meaningful and has 
shown little change compared with the true 
uncertainty in radiative transfer, as repre- 
sented by the spread in the LBL calculations. 

This lack of progress over the past 25 years 
is disconcerting. The spread in model calcu- 
lations of CO, forcing does not represent an 
uncertainty in radiative transfer theory, but 
rather the failure to implement that theory 
consistently in radiative transfer parameter- 
izations. This introduces unnecessary noise 
into the model experiments that is difficult to 
remove. Although the users of these models 
are largely unaware of this ongoing problem, 
the unsatisfactory implementation of CO, 
forcing propagates needlessly onto efforts to 
reduce uncertainty in projections of future 
climate change. 
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climate projections undermines the 
utility of these models to answer fun- 
damental questions of central societal 
importance. These errors add unneces- 
sary confusion to the development of 
scientifically rigorous targets for atmo- 
spheric CO, concentrations—and there- 
fore, emissions reductions—that are 
required to limit global temperature 
change. Constraining global warming 
to less than 2°C, as set by the Paris Cli- 
mate Agreement, requires a limit to be 
set on the maximum globally averaged 
CO, concentration compatible with 
that constraint. This limit should be 
established by a multimodel ensemble, 
but the corresponding range of allow- 
able CO, concentrations is unnecessar- 
ily large because the ensemble does not 
consistently incorporate known and 
established physics that relate rising CO, con- 
centrations to radiative forcing. 

Although some efforts are under way to 
better document these differences (8), there 
are two immediate solutions that could help. 
First, it is essential that radiative forcing be 
routinely computed and reported for models 
that participate in Coupled Model Intercom- 
parison Projects (CMIP), a series of coordi- 
nated experiments performed in support of 
the IPCC assessments. For each experiment, 
model simulations are performed by using 
matching emission scenarios, with the intent 
of imposing identical forcings. However, ra- 
diative forcing is rarely reported explicitly by 
these models. Requiring models to do so for 
all emission scenarios would help to ensure 
transparency between the radiative forcing 
experienced by the models and the climate re- 
sponse that results. Cess et al. made a similar 
recommendation 25 years ago (2). The adop- 
tion of this recommendation is long overdue. 

Second, the diversity of radiative transfer 
parameterizations used in GCMs should be 
reduced. Maintaining diversity in models is 
valuable for areas where there is substan- 
tial uncertainty in the underlying physics. 
For most aspects of radiative transfer of rel- 
evance to climate change, this is not the case. 
An effort to consolidate the number of radia- 
tive transfer parameterizations used and to 
implement only those that have been thor- 
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would enable those researchers who focus on 
less well-known forcing agents and their radi- 
ative interactions to have a readily available, 
radiometrically accurate understanding of 
the direct radiative influence of the quantities 
they are measuring, and the processes they 
are studying, on Earth’s climate system. 
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NEUROSCIENCE 


Neurotechnology to address big questions 


Profiling of single neurons in tissue allows structure and function linkage in brain circuits 


By Thomas Kn6pfel 


motions, cognition, and conscious- 

ness emerge with the processing of 

neuronal information provided by 

memory readouts and senses. How 

this actually works seemed for a long 

time to be a question too big to ask. 
Advances in neuroscience have always been 
driven by methodological inventions, and 
current efforts to develop neurotechnolo- 
gies are motivated by experimental strate- 
gies, including analytical dissection (that is, 
inverse engineering), large-scale interroga- 
tion, and synthetic reconstruction of the 
mammalian cerebral cortex (which is im- 
portant for higher cognitive 
functions) and connected 
brain structures (J, 2). As 
part of this neurotechnolog- 
ical endeavor, mapping the 
transcriptome of neurons 
with single-cell resolution 
and with known three-di- 
mensional tissue localiza- 
tion has been a long-sought 
enabling technology (3). On 
page 380 of this issue, Wang 
et al. (4) present the tech- 
nology and workflow to ac- 
cess transcriptional states of 
more than 100 genes from 
up to 30,000 cells in a cubic 
millimeter of cortical tissue. 
This advanced methodology 
will facilitate studies that 
improve our understand- 
ing of the neuronal hard- 
ware and, when combined 
with other emerging neuro- 


Mouse visual 
cortex 


Mouse brain 


technologies, will enable big 8»™—< 
questions in neuroscience Dissected —~ 
to be addressed. sample 


Wang et al. integrated 
knowledge of mouse cortex-specific gene ex- 
pression, targeted signal amplification, in situ 
transcriptomics, and hydrogel-tissue chemis- 
try (5) into a workflow for single-cell resolu- 
tion spatial transcriptomics, called STARmap 
(spatially resolved transcript amplicon read- 
out mapping). Targeted signal amplification 
was achieved by generating circular comple- 
mentary DNAs (cDNAs) after recognition of a 
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Tissue collection 


V1 cortex 
dissection 


In situ DNA sequencing 


specific transcript [messenger RNA (mRNA)] 
by a probe from a library of gene-specific 
primers. These cDNAs were then rolling-cir- 
cle-amplified so as to generate a DNA nano- 
ball (amplicon) that contains multiple copies 
of the cDNA templates (6). Importantly, this 
process takes place in structurally conserved 
brain tissue. Formation of the circular cDNAs 
included a barcode as a distinct identifier, 
which was decoded with a newly designed 
error-reduced in situ sequencing technology 
to identify specific mRNAs (5). 

Since the development of techniques that 
involve mRNA harvesting from single cells 
for sequencing—single-cell reverse transcrip- 
tion-polymerase chain reaction (scRT-PCR)— 


Profiling brain tissue 
Wang et al. provide a method to determine the activity of marker genes within a sample of 
brain tissue. This allows identification and mapping, for example, of subtypes of excitatory 
or inhibitory neurons in the cortical layers, corpus callosum, and hippocampus. 


transcriptomics has been instrumental to 
classify the many functionally distinct cell 
types found in the cerebral cortex (7). scRT- 
PCR in combination with electrophysiologi- 
cal recordings has the advantage to directly 
link specific gene expression patterns and 
cellular functional characteristics (includ- 
ing information on how specific neurons 
integrate synaptic information and on their 
synaptic connectivity) but is not practical for 
more than 100 cells (7). Scaling this approach 
is required for a systematic analysis of cellu- 
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lar diversity and brain state-dependent tran- 
scription. STARmap can provide information 
about the transcriptome (up to 10,000 genes) 
of up to 30,000 cells in a cortical volume pre- 
pared from an individual mouse brain. 

Hydrogel-tissue chemistry (5) was needed 
to fix the amplicons in the cubic millimeter 
tissue samples. Classification of cortical cell 
type based on anatomical location, gene ex- 
pression pattern, and functional properties 
has been performed in the past by accumulat- 
ing data across many experiments and many 
specimens (7). This knowledge served the 
authors to identify a set of 160 target genes, 
including 112 putative cell-type marker genes 
and 48 genes that are expressed in response 
to neuronal activity pat- 
terns. It is not known which 
of the selected marker genes 
are most indicative for spe- 
cific cell types and which of 
the activity-regulated genes 
are recruited in which cell 
types and under specific pat- 
terns of neuronal activity. 
Therefore, Wang et al. used 
this set of target genes both to 
benchmark their methodol- 
ogy and workflow but also to 
contribute to efforts aimed at 
detecting and classifying cell 
types and corresponding tis- 
sue-organization principles 
in the neocortex of adult 
mouse brain (see the figure). 

A weakness of STARmap 
is the lack of direct linking 
of transcriptomes with cel- 
lular protein expression and 
function. However, because 
several of the chosen marker 
genes encode proteins—in- 
cluding ion channels, recep- 
tors, and neurotransmitter- 
related enzymes that indicate functional 
properties—an indirect link between tran- 
scriptomics and function is already provided. 
In future implementations, a larger number 
of genes indicating functional properties may 
replace marker genes with low cell type- 
specificity to increase the possibilities for 
functional inference. 

The high data content from a single tissue 
specimen provided by STARmap allows for 
correlation of gene activation patterns within 
an individual’s specific brain state at the time 
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of tissue preparation. As a proof of principle, 
the authors compared the transcriptome 
from two cohorts of mice: The first was ex- 
posed for 1 hour to light after 4 days of hous- 
ing in darkness, whereas the second cohort 
was housed continuously in darkness before 
tissue collection. STARmap analysis con- 
firmed and refined our knowledge about the 
recruitment of activity-dependent genes and 
their enhancer RNAs (which control tran- 
scription of their target genes). In the future, 
a larger range of physiological brain states as 
well as rodent models of brain diseases, and 
even human tissue, may be analyzed this way. 

The STARmap data are a valuable source 
for data mining to identify patterns and 
higher-level structural and functional prin- 
ciples, many of which are likely still hid- 
den behind the complex nature of cortical 
circuits and their function. The first appli- 
cations of STARmap already revealed an in- 
triguing finding: a short-range self-clustering 
organization of inhibitory neuron subtypes 
(3). These clusters may just be relicts of cell 
differentiation and cortical morphogen- 
esis with little functional consequences, but 
more exciting are the possible functional 
implications. Among these is the increased 
opportunity of these inhibitory neuron sub- 
types to form direct electrical connections 
with cell bodies and the proximal processes 
of neighboring cells. These gap junctions are 
involved in the generation of high-frequency 
rhythmic circuit activities (8). 

STARmap is well suited for further inter- 
facing with other recent technologies that 
have been developed to address the big ques- 
tions on brain functions. The technology clos- 
est in reach for combination with STARmap 
is functional in vivo optical imaging with ac- 
tivity indicators (9). This combination could 
be tremendously helpful for directly linking 
gene expression patterns with cell activity 
patterns and brain circuit functions. Another 
exciting possible application of STARmap 
could be the mapping of trans-synaptically 
distributed (70) and activity-dependent ex- 
pressed barcoded optical activity reporters. 
This combination of cutting-edge technolo- 
gies would allow efficient linking of cellular 
activity with connectivity, cell-type classifica- 
tion, and proteomic state. 
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GENE EXPRESSION 


Dynamic condensates 
activate transcription 


Transcriptional components exhibit 
transient phase separation to drive gene activation 


By Aaron J. Plys!? and Robert E. Kingston!” 


very aspect of human function, from 
proper cell differentiation and develop- 
ment to normal cellular maintenance, 
requires properly timed activation of 
the necessary genes. This requires tran- 
scription of genomic DNA into messen- 
ger RNA (mRNA), accomplished by RNA 
polymerase II (RNA Pol IJ), which initiates 
transcription at gene promoters. This highly 
regulated process requires hundreds of pro- 


tories,’ where components are organized and 
ready to act on a gene that goes to the cellular 
location of the factory (1). On pages 378, 379, 
and 412 of this issue, Chong e¢ al. (2), Sabari 
et al. (3), and Cho et al. (4), respectively, argue 
that special protein domains, which interact 
with each other to form fleeting or more per- 
sistent interactions, form biomolecular con- 
densates that concentrate the transcription 
machinery. Some of these condensates might 
even form droplets, generating a liquid phase 
separated from the rest of the nucleus. Phase 


Dynamic transcription machinery clustering during gene activation 
Transcription factors (TFs) and coactivators condense into high-concentration clusters in the nucleus. 
Condensation is mediated by low-complexity disordered regions (LCDRs) in these proteins. These clusters 
can incorporate RNA Pol Il through transient interactions to efficiently activate gene transcription. 


Super- 


Super-enhancer clustering of 
Mediator or BRD4 and transient 
interaction with RNA Pol Il 


Sequence-specific transcription 
factors can cluster at promoters 
and transiently recruit RNA Pol Il. 


teins that must go to the promoter in a coor- 
dinated manner. Although many of these 
proteins are already organized into large and 
stable protein complexes, and so travel as a 
group, the process still requires coordination 
of many individual proteins and preformed 
complexes so that they are all in the same 
place on genomic DNA at the same time. This 
problem has been appreciated for years and 
has led to models such as “transcription fac- 
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coactivator LCDR 
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separation is a phenomenon familiar to any- 
one who has made a salad dressing: The oil 
and vinegar exist as two separated liquids. 
Phase separation in cells creates membrane- 
less organelles that, in this case, provide the 
organization necessary for productive tran- 
scription (5). 

Several distinct types of proteins are 
needed for transcriptional activation. Gene- 
specific transcription factors (TFs) bind to 
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specific regions of the genome. They then 
interact with large complexes needed for the 
transcription process, including a key com- 
plex called Mediator, which, in turn, interacts 
with RNA Pol II to increase transcription 
from a promoter (6). Enhancer sequences, 
which are physically removed from the pro- 
moter in the genome, also increase transcrip- 
tion and can be bound by other specialized 
proteins (7). Regions of proteins involved in 
these processes were analyzed in the three 
studies, with the notable finding that several 
different transcriptional regulatory com- 
ponents each contain protein domains that 
form condensates in cells. These condensates 
increase the effective concentration of com- 
ponents needed for transcriptional activation 
and allow organization of those components 
via numerous cooperative interactions within 
the condensates, thus provid- 
ing an attractive mechanism 
for combining factors in a 


These puncta are formed via interactions 
between domains that are called either low- 
complexity domains (LCDs) or intrinsically 
disordered regions (IDRs). These domains 
have limited types of amino acids and are 
characterized as disordered according to 
their predicted secondary structure (JJ). To 
unify the terminology, we refer to these do- 
mains as low-complexity disordered regions 
(LCDRs). The current hypothesis is that pro- 
teins with these domains form networks, 
based, in part, on hydrophobic interactions, 
that are individually short lived and that al- 
low for dynamic interplay that can create 
liquid-like properties (72). All three studies 
offer support for this hypothesis by showing 
that 1,6-hexanediol, which impairs hydropho- 
bic interactions, can disrupt the structures. 
They also all used fluorescence recovery af- 
ter photobleaching (FRAP) 
to show that molecules move 
in and out of these puncta 


timely fashion to generate “special pr otem rapidly, indicating that the 
transcription. domains ... form components that make up 

The visualization of these ° the puncta are dynamic and 
condensates required the use biomolecular not solid aggregates. Finally, 
of imaging technologies to condensates that Sabari et al. and Cho et al. 
characterize the behavior of show, with live-cell movies, 
individual protein domains, concentrate that the puncta can merge 
which was compared to the the transcription together, just as water drop- 
behavior of entire complexes. 7 Ee lets will form a bigger droplet 
The domains were fused to machinery. when they interact on a glass 


fluorescent proteins to allow 

visualization with lattice light-sheet imaging 
(8). This allowed sufficient spatial and tem- 
poral resolution to see condensates, which 
display as puncta of fluorescence, formed by 
these domains in living cells and to charac- 
terize the dynamics of these condensates (see 
the figure). Domains of the TFs, FET [com- 
posed of FUS, EWS, and TAFI5 (TATA-box 
binding protein-associated factor 15)] and 
SP1 (specificity protein 1) form puncta. Thus, 
these protein domains cluster with each 
other instead of freely diffusing separately 
from each other. Similarly, the enhancer 
binding factors BRD4 (bromodomain-con- 
taining protein 4) and MED1 (mediator of 
RNA Pol II transcription subunit 1) were seen 
in discrete puncta. MED1 is one of more than 
20 proteins that comprise the Mediator com- 
plex (9), and the entire Mediator complex can 
also be seen as puncta. RNA Pol II, itself com- 
posed of 12 subunits, also forms clearly delin- 
eated puncta. There is evidence for multiple 
molecules important for transcription being 
incorporated into the same phase-separated 
condensates (10). Chong et al. and Cho et al. 
found that RNA Pol II colocalized with TFs 
or Mediator puncta in live cells, respectively. 
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surface. Thus, 
have liquid-like characteristics. 
The studies explore the role for specific 
protein sequences in self-association. In 
Chong e¢ al., examination of the LCDRs of 
TFs shows that there is specificity in the 
interactions. Factors can self-associate (for 
example, FET LCDRs), but certain inter- 
actions between separate factors (for ex- 
ample, FET LCDRs and the SP1 LCDR) do 
not occur. This might be due to differences 
in the sequences of the LCDRs. Mutational 
analysis of the EWS LCDR demonstrated 
that 29 tyrosine residues were required for 
LCDR interactions. Similarly, Sabari et al. 
found that the MED1 LCDR sequence was 
dominated by serine residues, which were 
required for self-association and liquid 
droplet formation in vitro. 
The extent to which phase separation is 
a necessary element of transcriptional activa- 
tion is called into question by Chong e¢ al., 
who found that TF LCDR self-assembly is 
transient at physiological concentration, in 
the range of seconds, and thus not consistent 
with phase separation into isolated droplets. 
This raises the question of when phase sepa- 
ration is an important part of the mechanism 
as opposed to a side effect of more transient 
interactions. Transient interactions between 
LCDRs might play a critical role in organizing 
components without a need for a stably phase- 


the puncta 
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separated state. One possibility is that some 
interactions need only be transient, whereas 
others require greater stability. Perhaps the 
“potency” of domains to phase-separate (that 
is, the concentration needed to achieve phase 
separation) varies depending on need. For 
example, in Cho et al., Mediator and RNA Pol 
II both have properties consistent with be- 
ing phase-separated at normal physiological 
concentration. This is presumed to help the 
two complexes interact, but the interaction 
between the two complexes is transient. Ac- 
tivation of transcription by necessity requires 
that some interactions be transient: RNA Pol 
II must initiate transcription and elongate 
the transcript by interacting with different 
complexes (13). The ability of the activa- 
tion components to move onto the next 
round of transcription, and thus switch 
contacts to a new RNA Pol II, is likely to be 
important for genes that are being rapidly 
transcribed. Thus, in transcriptional acti- 
vation, there are sound theoretical reasons 
to have interactions that not only increase 
effective local concentration (as transient 
interactions between LCDRs would) but 
that also allow those components to be in- 
tegrated with, instead of separated from, 
other nuclear components. 

By contrast, in other regulatory settings, 
phase separation might be helpful or even 
necessary for appropriate regulatory func- 
tions of transcription, for example, interac- 
tions involved in stable repression (J4, 15) or, 
perhaps, long-range enhancer interactions. 
The continued development of technologies 
and experimental strategies to determine the 
importance of phase separation will be an ex- 
citing area to follow. Are these condensates 
central to most nuclear functions and thus a 
general regulatory mechanism with multiple 
distinct specificities and temporal charac- 
teristics? How much of nuclear function oc- 
curs in phase-separated domains? Do long 
noncoding RNAs, which are prevalent in the 
nucleus, contribute to the potency of phase 
separation and/or the organization of phase- 
separated domains? When is phase separa- 
tion essential to the processes that generate 
the regulatory organization needed for life? 
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CHEMISTRY 


Hydrocarbon synthesis with vinyl cations 


Catalytic generation of vinyl cations enables synthesis of functionalized hydrocarbons 


By Sean H. Kennedy and 
Douglas A. Klumpp 


ydrocarbons—substances composed of 
carbon and hydrogen—are central to 
many industrial processes, including 
fuel preparation, polymer manufac- 
turing, and the synthesis of feedstock 
chemicals (7). Many biosynthetic path- 
ways also involve hydrocarbon reactions. For 
example, geranyl, farnesyl, or geranylgeranyl 
diphosphates are transformed into thousands 
of different terpenoid natural products (2). 
Hydrocarbons are key components of extra- 
terrestrial atmospheres, nebulae, and other 
regions of space (3). Reactions of simple 
or polycyclic hydrocarbons were probably 
among the prebiological conversions that led 
to more complex molecular structures and 
the emergence of life (4, 5). On page 381 of 
this issue, Popov et al. (6) report a method for 
catalyzing hydrocarbon reactions that have 
been difficult to achieve with other methods. 

Over the past century, great strides have 
been made in understanding hydrocar- 
bon chemistry and using hydrocarbons in 
synthetic reactions. Nevertheless, carrying 
out useful chemical transformations on 
the least-reactive hydrocarbon substrates— 
namely, the saturated alkanes and cycloal- 
kanes—remains challenging. As a result, 
there has been much effort to develop C-H 
bond activation chemistry for hydrocar- 
bons and hydrocarbon groups (7). These 
compounds are relatively inert because of 
their strong C-H bonds and the absence 
of reactive frontier molecular orbitals. To 
develop new methods of synthesis with 
saturated hydrocarbons, chemists must 
devise techniques for generating activated 
reagents that can attack the strong C-C or 
C-H bonds of these substrates. Popov et al. 
accomplish this in their study of vinyl cat- 
ions and their chemistry. 

The authors show that use of a vinyl tri- 
flate, triethylsilane, and a trityl salt initiator 
leads to a catalytic reaction that converts al- 
kanes and cycloalkanes into functionalized 
products. For example, the vinyl triflate 1 
derived from cyclohexanone reacts with cy- 
cloheptane to provide the hydrocarbon 6 in 
88% yield (see the figure). This remarkable 
transformation is accomplished under very 
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mild conditions for a C-H functionaliza- 
tion; it is complete in 2 hours at just 30°C. 
A key part of the methodology is the use of 
carborane, a weakly coordinating anion. As 
an inert counter ion, the carborane helps 
to stabilize highly electrophilic cationic in- 
termediates—the silylium ion 2 and vinyl 
cation 3—and enables the cations to exist as 
persistent species in the solution. This allows 
the authors to use the reactive vinyl cation 
in difficult hydrocarbon synthetic reactions. 
To demonstrate the synthetic utility of this 
method, the authors use it to functionalize 
n-alkanes, cycloalkanes, and arenes and to 
prepare a steroid derivative. 

Like a skilled trapeze artist swinging high 
above a circus floor, the chemistry moves 
beautifully through high-energy intermedi- 
ates (see the figure). The reaction sequence 
begins with an initiation step, in which the 
trityl cation abstracts hydride from triethyl- 
silane. This step produces the first high-en- 
ergy intermediate, the silylium ion 2. Since 


the first silylium ions were generated in the 


1990s, it has become clear that these spe- 
cies are very strong Lewis acids (8). As such, 
the silylium ion coordinates to the triflate 
leaving group, resulting in cleavage of the 
C-O bond. This step provides the second 
high-energy intermediate, the vinyl cation 
3. This intermediate inserts into the C-H 
bond of cycloheptane to provide trivalent 
carbocation intermediates 4 and 5. The fi- 
nal step involves hydride transfer from tri- 
ethylsilane to the carbocation 5, leading to 
the hydrocarbon product 6 and regenerat- 
ing the high-energy silylium ion 2. 

When the authors investigated the key 
C-C bond-forming reaction step in a mo- 
lecular dynamics simulation, they found 
that the C-H functionalization occurs 
through a nonclassical ionic structure 
(see the figure). Simulation of the reac- 
tion of vinyl cation 3 and cyclohexane 
suggests that the system proceeds from 
an ambimodal transition-state structure 
to the transient species 7, with a bridging 
C-H-C structure. This is followed by C-C 


Transforming saturated hydrocarbons 
Use of a silylium catalyst leads to the formation of a reactive vinyl intermediate that facilitates the formation 
of new hydrocarbon products. Et, ethyl: OTf, trifluoromethanesulfonate. 
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powerful Lewis acid 
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bond formation to give the transient spe- 
cies 8, a structure with an unsymmetrical 
C-C-C bridge and nonclassical ion char- 
acter. Elegantly designed mechanistic ex- 
periments with the vinyl triflates provide 
support for the nonclassical structures seen 
in the molecular dynamics simulations. 

Nonclassical carbocation structures are 
often characterized by three-atom two- 
electron bonding patterns. Because these 
types of structures deviate considerably 
from the common valance bond model 
(with two-atom-two-electron bonding), 
their existence and role in organic chem- 
istry has been a matter of considerable 
debate during the past 70 years. Strong 
evidence for nonclassical carbocations 
and related hypercoordinate carbon spe- 
cies has been reported—including spectro- 
scopic, chemical, and x-ray crystallography 
evidence—and these structures are now 
universally recognized as a viable part of 
hydrocarbon chemistry (9, 10). 

The existence of nonclassical carboca- 
tions was once questioned by many in the 


“Like a skilled trapeze artist 
swinging high above a 

circus floor, the chemistry 
moves beautifully through 
high-energy intermediates...” 


chemical community, but it is now clear that 
these types of structures have broad impor- 
tance in the chemistry of hydrocarbons (9). 
Nonclassical carbocation chemistry extends 
all the way from enzyme active sites in 
biosynthetic chemistry (77) to the catalytic 
cracking reactors in the petroleum industry 
(1). Popov et al.’s study demonstrates that 
a new generation of synthetic hydrocar- 
bon reactions is also possible through the 
involvement of vinyl cations and transient 
nonclassical carbocation structures. 
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TUMORIGENESIS 


Inflamed T cells and 
stroma drive gut tumors 


Loss of a tumor suppressor in T cells and stromal cells 
drives gastrointestinal polyp growth 


By Pablo E. Hollstein and Reuben J. Shaw 


nactivating somatic mutations causing 

loss of function in the tumor suppres- 

sor gene STK1J (serine-threonine kinase 

11), which encodes the protein LKB1 

(liver kinase Bl), frequently occur in 

several sporadic cancers, notably lung, 
pancreatic, and female reproductive tu- 
mors. Additionally, inherited heterozygous 
germline mutations in STKII cause Peutz- 
Jeghers syndrome (PJS), a cancer predis- 
position syndrome (J). A hallmark of PJS is 
the growth of numerous benign gastroin- 
testinal (GI) hamartomatous polyps and an 
elevated risk of developing malignancies in 
several organs (2). Studies in mouse models 
of PJS have pointed to LKB1 deficiency in 
the stroma, not epithelium, as the contrib- 
uting factor for the formation of the GI pol- 
yps (3). Stroma consists of the fibroblasts, 
smooth muscle, extracellular matrix, and 
basement membrane that support epithe- 
lial tissue. However, how LKB1 deficiency 
in stromal cells could trigger polyp forma- 
tion was unclear. Two recent studies now 
shed light on the mechanism of PJS polyp 
formation. On page 406 of this issue, Pof- 
fenberger et al. (4) find that loss of a sin- 
gle allele of Stk11 in T cells is sufficient to 
drive the formation of GI polyps. Similarly, 
a study by Ollila et al. (5) demonstrated that 
heterozygous loss of Stk1I in gastric stro- 
mal cells was sufficient to drive GI polyp 
formation. Transcriptional profiling in both 
studies revealed an up-regulation of inflam- 
matory cytokines involved in promoting the 
expansion and overgrowth of both stroma 
and normal GI epithelium. By finding that 
inflammation in the stromal compartment 
due to LKB1 loss in T cells (4) or stromal 
cells (5) drives polyp formation, these stud- 
ies highlight how inflammatory signals can 
profoundly alter the microenvironment and 
fuel tumor formation. 

GI polyps in PJS patients contain a mix- 
ture of cell types, all of which have inher- 
ited a heterozygous mutation in STK1/, and 
are characterized by a smooth muscle core, 
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an expanded stromal compartment, and 
hyperplastic epithelia (2). Poffenberger et 
al. used mouse models in which one allele 
of Stk11 could be conditionally deleted in 
different cellular compartments to under- 
stand which cells contribute to polyp for- 
mation. They observed that heterozygous 
deletion of Stki1 in GI epithelial cells did 
not promote polyp formation. Unexpect- 
edly, they instead found that heterozygous 
loss of Stk11 in the hematopoietic compart- 
ment was sufficient to drive growth of GI 
polyps that were histologically indistin- 
guishable from those in Stk17 heterozygous 
mice, and which also resembled polyps 
from PJS patients. Furthermore, they 
found that whereas heterozygous loss of 
Stk11 in B cells did not contribute to polyp 
formation, heterozygous loss of Stk1I in T 
cells was sufficient to promote intestinal 
polyposis. Complementing these findings, 
Ollila et al. showed that heterozygous loss 
of Stk1I in gastric stromal cells was suffi- 
cient to induce PJS-like polyps. 

How does loss of LKB1 lead to polyp 
formation? LKBI1 activity exerts pleiotro- 
pic effects on cell polarity, growth, and 
metabolism, including responses to cel- 
lular energy stress that are mediated by 
phosphorylating and hence controlling the 
activation of adenosine monophosphate 
(AMP)-dependent protein kinase (AMPK) 
(1). AMPK negatively regulates the activ- 
ity of mammalian target of rapamycin 
complex 1 (mTORC1), a central regulator 
of cell growth and proliferation (7). Conse- 
quently, loss of either LKB1 expression or 
AMPK activation results in up-regulation 
of mTORC1 signaling, which is thought to 
contribute to tumor growth in many con- 
texts, including PJS polyposis, and in spo- 
radic tumors with LKB1 loss (J). Previous 
studies found increased mTORCI signaling 
in the epithelial compartment of PJS polyps 
(6), and remarkably, Poffenberger et al. also 
observed this event when LKB1 was only 
deleted in T cells. Surprisingly, mice with 
T cell-specific ablation of AMPK did not 
develop polyps. Moreover, simultaneous 
deletion of Stk17 and Mtor in T cells still 
resulted in the development of GI polyps. 
Together, these results suggest that AMPK 


sciencemag.org SCIENCE 


8LOz ‘92 Ainr uo /610° Bewaouslds'90ua!0S//:djjy Woody papeojuUMOG 


PENU3 NOZFOTOBWIA FPYNNA "What's News" VK.COM/WSNWS 


GRAPHIC: V. ALTOUNIAN/SCIENCE 


and mTORC1 in T cells are dispensable for 
PJS polyp development in these models, 
and that alternative LKB1-dependent sig- 
naling pathways must be responsible for 
polyp growth. Similarly, Ollila et al. report 
that inactivation of AMPK in the GI stro- 
mal compartment did not induce polyposis. 

What signals from LKBI-deficient T 
cells or gastric stromal cells are sufficient 
to drive polyp growth? Poffenberger et al. 
found that heterozygous loss of Stk1I in T 
cells induced the activation of CD4* and 
CD8* T cells in mesenteric lymph nodes, 
with pronounced infiltration of those T 
cells, along with macrophages and neu- 
trophils, into the polyp microenviron- 
ment. Heterozygous expression of Stk11 in 
mice, or specifically in T cells (4) or stro- 
mal cells (5), resulted in the secretion of 
large amounts of inflammatory cytokines, 
including interleukin-6 (IL-6) and IL-11, 
which have previously been associated 
with gastric tumor development (7). This 
increased cytokine production was accom- 
panied by hyperactivated JAK-STAT (Ja- 
nus kinase-signal transducer 
and activator of transcription) 
signaling in the stromal com- 
partment, which contributes 
to inflammation and cancer 
(8). Furthermore, treatment 
with AZD1480, an inhibitor of 
JAKs, quenched STATS activa- 
tion in the stroma and signifi- 
cantly reduced polyp growth in 
Stk1I-heterozygous mice. Simi- 
larly, Ollila et al. found that 
activated JAK-STAT3 signaling 
in stromal cells drives polyp 
formation, and the JAK inhibi- 
tor ruxolitinib (which is already 
clinically approved for mye- 
loproliferative diseases) has 
therapeutic potential in mouse 
models of PJS. Taken together, a 
model emerges in which LKB1 
deficiency in T cells or gastric 
stromal cells produces a proin- 
flammatory microenvironment 
and subsequently gives rise to 
GI polyps (see the figure). 

If not through reduced 
AMPK function, how does 
LKB1 deficiency lead to the 
production and promotion of 
inflammatory cytokines? In 
addition to activating AMPK, 
LKBI1 also phosphorylates and 
activates a family of 12 addi- 
tional AMPK-related kinases 
(9), the potential roles of which 
in T cell deregulation were not 
explored by Poffenberger et al. 
Notably, Ollila et al. reported 
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that reduced expression of the AMPK fam- 
ily members SIK1 (salt-inducible kinase 1), 
MARK1 (MAP/microtubule affinity-regu- 
lating kinase 1), or MARK4 could induce 
production of IL-11 in fibroblasts, raising 
the intriguing possibility that one or more 
of these AMPK family members may me- 
diate production of the proinflammatory 
cytokine storm observed in human and 
mouse PJS polyps. 

The findings of Poffenberger et al. and 
Ollila et al. constitute an important ex- 
ample of how inflammation can be driven 
by haploinsufficiency of tumor suppressor 
genes involved in inherited familial cancer 
syndromes. Similar to LKB1 deficiency in 
PJS, heterozygous loss of other tumor sup- 
pressors can drive benign tumor growth 
containing mixed cell types, including epi- 
thelial, stromal, and immune cell compo- 
nents (J0). Examples of heterozygous loss in 
mixed-lineage tumors include loss of APC 
(adenomatous polyposis coli) in familial 
adenomatous polyposis, SMAD4 in familial 
juvenile polyposis (FJP), PTEN (phospha- 
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microenvironment in polyposis 
Heterozygous loss of LKB] expression in T cells or stromal cells is sufficient to 
induce proinflammatory cytokines, including IL-6 and IL-1, which recruit neutrophils 
and other inflammatory immune cells. This inflammatory microenvironment 

drives JAK-STATS3 signaling in stromal and epithelial cells, concurrent with epithelial 
mTORC1 activation, and is sufficient to induce polyp growth. 
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tase and tensin homolog) in Cowden’s dis- 
ease, TSC1 (tuberous sclerosis complex 1) 
and TSC2 in tuberous sclerosis, VHL (von 
Hippel-Lindau tumor suppressor) in von 
Hippel-Lindau syndrome, and NF1 (neuro- 
fibromatosis 1) in neurofibromatosis type 
1. The involvement of immune cell compo- 
nents in driving benign tumor formation as 
demonstrated by Poffenberger et al. echoes 
observations in mouse models of disor- 
ders such as FJP, in which loss of SMAD4 
in T cells up-regulates inflammatory cyto- 
kines and results in gastrointestinal polyp 
growth (11, 12), and in neurofibromatosis 
type 1, in which Nfi-heterozygous mast 
cells (a type of inflammatory immune cell) 
secrete proinflammatory cytokines into the 
nerve microenvironment that are required 
for and drive neurofibroma growth (13). 
It will be very interesting to see whether 
the involvement of a proinflammatory, de- 
regulated immune system also contributes 
to benign tumor growth in other familial 
cancer syndromes besides PJS, FJP, and 
neurofibromatosis type 1 (4, 5, 11-13) or 
in sporadic solid tumors with 
LKB1 loss (14, 15). 

The findings of Poffenberger 
et al. and Ollila et al. also sug- 
gest that JAK inhibitors may 
be potential new therapeutic 
modalities for GI and other tu- 
mors arising in PJS patients for 
which there are few treatment 
options. Future exploration of 
the involvement and therapeu- 
tic potential of targeting JAK- 
STAT3 signaling in sporadic 
tumors with LKB1 inactivation 
is now also warranted. 


REFERENCES 


1. D.B. Shackelford, R. J. Shaw, Nat. Rev. 
Cancer 9,563 (2009). 


Smooth muscle cells. 2. A.Hemminki, Cell Mol. Life Sci, 55,735 
—-— (1999). 
3. P.Katajisto et al., Nat. Genet. 40,455 
(2008). 
4. M.C.Poffenberger et al., Science 361, 
406 (2018). 
Gl epithelial cells 5. S.Ollilaetal.,J. Clin. Invest. 128,402 
(2018). 


Published by AAAS 


(mTORCI*, JAK-STAT3*) 


6. R.J. Shaw etal., Cancer Cell 6,91 (2004). 

7. T.L.Putoczkietal., Cancer Cell 24,257 
(2013). 

8. H.Yuetal., Nat. Rev. Cancer 9,798 
(2009). 

9. J.M.Lizcano et al., EMBO J. 23, 833 
(2004). 

10. K.Inoki,M.N.Corradetti, K.L. Guan, Nat. 
Genet. 37,19 (2005). 

ll. B.-G.Kimetal., Nature 441, 1015 (2006). 

12. J.N.Hahn etal.,J. Clin. Invest. 121, 4030 
(2011). 

13. F.-C. Yang et al., Cell 135, 437 (2008). 

14. S.Koyamaetal., Cancer Res. 76,999 
(2016). 

15. F.Skoulidis et al., Cancer Disc. 8,822 
(2018). 


10.1126/science.aau4804 


27 JULY 2018 * VOL 361 ISSUE 6400 333 


8LOz ‘92 Ainr uo /610° Bewaousids'90ua!0S//:djjy Woody papeojuMOG 


PEM3 NOATOTOBUJIA TPYNNA "What's News" VK.COM/WSNWS 


INSIGHTS | PERSPECTIVES 


RETROSPECTIVE 


Paul D. Boyer (1918-2018) 


Pioneer of molecular machines and inspirational leader 


By David S. Eisenberg 


n 2 June, 2 months before his 100th 

birthday, Paul Boyer died peacefully, 

surrounded by family, including Lyda, 

his wife of nearly 79 years. Paul was 

deeply admired by his students, col- 

leagues, and friends for his many tal- 
ents. He was skilled in mechanics, science, 
diplomacy, administration, and athletics. Be- 
yond his scientific success, culminating in a 
share of the 1997 Nobel Prize in Chemistry, 
Paul contributed to the community by serv- 
ing as a role model, demonstrating the wisest 
path forward in tough situations. 

Paul died at his home in Los Angeles, which 
he built, serving as his own general contrac- 
tor. Recalling Paul’s mechanical skills, his 
daughter Gail said that they had never had a 
repairman in the house. Earlier, while at the 
University of Minnesota, Paul and Lyda had 
designed their first home, for which Paul was 
the plumber, electrician, and cabinet maker. 
Paul was teaching his class at the university 
when the inspector arrived at their homesite 
to evaluate the plumbing. He lingered in- 
tently over each fitting and junction and then 
confronted Lyda with the question, “Who did 
this work?” Her heart fell as she told him it 
had been her husband. “He must have been 
trained in the old country,’ said the inspector, 
“they don’t do fine work like this anymore.” 

In fact, Paul was raised in Provo, Utah, 
the fourth of seven children, a descendant 
of what he called “hardy Mormon pioneer 
stock.” Although he rejected the religious ten- 
ets of his church, he attributed his scientific 
career to parental and community devotion 
to education. He attended Brigham Young 
University, a few blocks from his home, grad- 
uating with a degree in chemistry in 1939. 
That autumn, he left for graduate school at 
the University of Wisconsin-Madison armed 
with $150 in cash, a $400-per-year scholar- 
ship, and his new bride, Lyda. He soon devel- 
oped a second love—for biochemistry. After 
earning his Ph.D. in the subject in 1943, he 
did war research on blood proteins at Stan- 
ford University, joined the faculty at the Uni- 
versity of Minnesota, where he introduced 
chemical and isotopic methods for the study 
of enzyme mechanisms, and then settled 
for good at the University of California, Los 
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Angeles (UCLA), in 1963. Starting at age 70, 
Paul and Lyda enjoyed an active retirement 
of tennis, golf, bridge, and travel throughout 
the American West, interrupted only by the 
award of Paul’s Nobel Prize at age 79. 

Paul’s most profound discovery was that 
adenosine triphosphate (ATP), the energy 
currency of life, is synthesized in living cells by 
arotary molecular engine, ATP synthase. The 
inference of this unprecedented mechanism 
from measurements of chemical rates could 
have been made only by a scientist with Paul’s 
deep mechanical understanding. He applied 
mass spectrometry to monitor oxygen-18 
exchange kinetics to understand the en- 
zymatic mechanism of ATP synthase. His 
experiments suggested that the energy of oxi- 


dation is coupled to the release of ATP from 
the enzyme, rather than to ATP synthesis (his 
so-called “binding change mechanism”), and 
that the enzymatic reaction of ATP synthesis 
must involve more than one catalytic site. 
From these insights, he correctly proposed a 
rotary molecular engine, later gloriously illu- 
minated by the crystal structure of ATP syn- 
thase by British chemist John E. Walker. For 
this work, he and Walker shared half of the 
Nobel Prize in Chemistry in 1997. 

Paul had the curiosity and confidence to 
grapple with big questions and big projects. 
Though he studied some 20 enzymes during 
the course of his career, he kept returning 
to ATP synthase. As he said in a biographi- 
cal memoir, “Although the larger questions 
[about oxidative phosphorylation] were not 
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likely to be answered, at least I wanted to 
try.”” He knew that “effort is difficult with- 
out optimism, and accomplishment is rare 
without effort.” 

Soon after moving from Minnesota to 
UCLA, Paul applied his optimism and effort 
to founding the Molecular Biology Institute 
(MBI). He immediately set out to recruit fac- 
ulty (around 20 over time, including me) and 
students (some 400 have completed molecu- 
lar biology Ph.D.s as of 2018). With phenom- 
enal persistence, he put together a mosaic of 
funding sources for construction of the labo- 
ratory now known as Boyer Hall. However, 
not every project of Paul’s came to fruition. In 
1980, he envisioned a set of bicycle freeways 
emanating from UCLA that would change 
the character of Los Angeles. Clearly, that 
idea was before its time, but Paul could once 
again say “at least I wanted to try.” 

Active in the national leadership of the 
biochemical community, Paul served as presi- 
dent of the society now called the American 
Society for Biochemistry and Molecular Biol- 
ogy, as well as chair of the Division of Bio- 
logical Chemistry of the American Chemical 
Society. With Lyda’s assistance, he produced 
18 volumes of the review series The Enzymes. 

Paul’s many achievements were made pos- 
sible by his optimism, effort, ambition, and 
know-how, but there was something more: 
character. In my every encounter with Paul, 
from the first in 1967 to my last shortly before 
his death 50 years later, I sensed his drive 
to get at the truth, his generous judgments 
of others and their work, and his modesty. 
Together, these traits made him the most 
effective leader that many of us have ever 
personally known. Under his 18-year direc- 
torship of the MBI, faculty meetings were 
amicable and short. Paul had discussed the 
issues beforehand with each of us and had 
forged a consensus prior to the meeting. A 
generous author, Paul carefully noted the 
advances made by each of his graduate stu- 
dents and postdoctoral fellows in his several 
scientific autobiographies. In appreciation of 
these contributions, Paul donated to UCLA 
part of his Nobel Prize stipend for awards to 
his current UCLA postdoctoral fellows. Paul 
also invariably took care to credit the results 
from the labs of other scientists. Even when 
he disagreed, the tone was one of respect, 
sometimes noting where in hindsight his 
own interpretation had been wrong. 

In the present era, when science and rea- 
son are under attack, facts are considered 
optional, lying by leaders is not unexpected, 
and bullying and falsely hogging credit are 
tolerated, Paul Boyer’s life stands as a model 
of the proper path for all of us who have had 
the privilege of knowing him. 
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The anticommons at 20: 
Concerns for research continue 


Emerging trends in exclusionary rights may affect research 


By Jorge L. Contreras 


ifty years ago, Garrett Hardin fa- 
mously predicted that unrestrained 
use of a common resource could lead 
to its overuse and depletion—the so- 
called “tragedy of the commons.” 
Thirty years later, Heller and Eisen- 
berg introduced the term “anticommons” to 
the research-policy lexicon (J), postulating 
that unfettered exercise of individual prop- 
erty rights could lead to an equally tragic 
underutilization of resources. By drawing 
on analogies from land-use planning, they 
reasoned that if multiple holders of intellec- 
tual property (IP) rights, particularly pat- 
ents, covering a biomedical technology can 
individually block others from conducting 
research on that technology, then overall 
research progress could be stifled. Though 
many observers now agree 20 years later 
that empirical evidence of a patent-based 
anticommons in biomedical research is in- 
conclusive (2), if not wholly refuted (3, 4), 
there are emerging areas beyond patent law 
in which the proliferation of exclusionary 
rights may threaten research in much the 
way that Heller and Eisenberg predicted. 
Although a common resource such as a 
pasture or a river is held by multiple parties 
for the benefit and use of all, an anticom- 
mons is held not in common, but in sepa- 
rate, yet co-dependent, fragments, whereby 
any individual owner may block others’ use 
of the whole. Heller and Eisenberg were 
particularly concerned with privatization 
of “upstream” research tools—basic scien- 
tific discoveries and techniques that can be 
used to develop a range of “downstream” 
diagnostics and therapeutics—arguing that 
“each upstream patent allows its owner 
to set up another tollbooth on the road to 
product development, adding to the cost 
and slowing the pace of downstream bio- 
medical innovation” (J). For example, a 
proliferation of patents, sometimes called 
a patent thicket, covering specific protein 
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receptors or basic DNA sequencing tech- 
niques could impede research on a broad 
range of applications. Only by limiting the 
potential hold that multiple patent holders 
may have over basic research tools, they ar- 
gued, can the inefficiencies that are likely 
to emerge from anticommons be averted 
(1). They thus called on governmental ac- 
tors such as the U.S. National Institutes of 
Health (NIH) to “ensure coherent bound- 
aries of upstream patents and to minimize 
restrictive licensing practices that interfere 
with downstream product development.” 

In their conceptualization of the anticom- 
mons, Heller and Eisenberg challenged ear- 


“In the regime of the 
anticommons, it is excessive 
privatization of resources 
that hinders, rather 

than helps, the productive 
use of assets.” 


lier theoretical work [for example, (5)] that 
argued that assets, including inventions and 
other intangibles, will be put to their high- 
est and best use only if they are effected with 
private interests that incentivize their own- 
ers to manage and exploit them efficiently—a 
property-based solution to Hardin’s tragedy 
of the commons. Heller and Eisenberg’s 
work signaled a new wave of interest in the 
application of commons-based solutions to 
intellectual assets, building on the founda- 
tion laid by Elinor Ostrom and others in the 
area of tangible common pool resources. In 
addition to privatization strategies, tragedies 
of the commons can also be solved by collec- 
tive management of common assets. In the 
regime of the anticommons, it is excessive 
privatization of resources that hinders, rather 
than helps, the productive use of assets. 


THE SEARCH FOR ANTICOMMONS 


Soon after the Heller and Eisenberg article, 
researchers began to seek empirical evidence 
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of a developing anticommons in biomedical 
research. The proliferation of patents on 
individual genes and key sequencing pro- 
cesses was of particular concern. One study 
reported that U.S. patents covered approxi- 
mately 20% of known human genes (6), and 
another found that patents had a negative 
impact on the use and development of mo- 
lecular diagnostic tests (7). 

Others worried that patenting human 
DNA and other research tools could result 
in serious impediments to the develop- 
ment of multigene diagnostic panels and 
DNA chips that could simultaneously test 
for large numbers of genetic variants, as 
well as high-throughput genotyping and se- 
quencing platforms and animal models (8). 
Another study showed that genetic variants 
contractually protected by the private firm 
Celera Genomics, when compared to pub- 
lic data on comparable variants from the 
Human Genome Project (HGP), “generated 
economically and statistically significant 
reductions in subsequent scientific research 
and product development” (9). 

Findings like these set off alarm bells 
throughout the research community and 
seemed to validate the anticommons hy- 
pothesis. However, outside of genetics, stud- 
ies found that the behavior of biomedical 
researchers was not substantially affected 
by the existence of patents. In many cases, 
academic researchers simply appeared to ig- 
nore patents in their research (2). 


POOLING, AGGREGATION, COMMONS 

If we have not seen the emergence of a pro- 
nounced patent anticommons in biomedical 
research, it is worth asking why. One oft- 
raised possibility is that rational private ac- 
tors, when faced with the fragmentation and 
congestion threatened by an anticommons, 
will develop collective and mutually benefi- 
cial solutions to enable research to advance 
nevertheless. Such “pooling” approaches 
have arisen to address rights fragmentation 
in industries ranging from music to aircraft 
to radio and telecommunications (3). 

Heller and Eisenberg recognized that the 
gridlock threatened by anticommons could 
be remedied through bargaining and the 
formation of IP pools. Yet they expressed 
reservations about the general viability of 
IP-pooling solutions. They identified sev- 
eral distinct features of the biotechnology 
and pharmaceutical industries that could 
make successful bargaining over patent 
rights less successful than in fields such 
as aviation and music, including transac- 
tion costs associated with accumulating 
sufficient rights to practice biotechnology 
inventions, the heterogeneous interests of 
patent holders, and cognitive biases that 
cause biotechnology patent holders to 
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overestimate the value of their own techni- 
cal contributions (1). 

But, despite years of advocacy surround- 
ing potential patent pools for HIV/AIDS and 
severe acute respiratory syndrome (SARS) 
research and the emergence of substantial 
pooling activity in other industries, rela- 
tively few patent pools—and none of com- 
mercial importance—have formed in the 
pharmaceutical or biotechnology sectors. 
Several factors could explain the absence of 
pooling in this arena: the need for at least 
some market exclusivity in an environment 
with extremely high costs of product de- 
velopment, clinical trials, and regulatory 
approval; patent holders’ desire to retain 
control over their assets; and concerns over 
compromising commercial secrecy by col- 
laborating with others (4). 

Although the formal pooling of frag- 
mented property interests in biomedical 
research may not have materialized in a 
meaningful way, this sector is also char- 
acterized by a remarkable, and arguably 
unique, outpouring of valuable discoveries 
to the public domain. The sum of these con- 
tributions may be moving us toward a com- 
prehensive medical-information commons 
(10)—the very antithesis of the anticom- 
mons predicted by Heller and Eisenberg. 
The beginning of this trend is often traced 
to the 1996 Bermuda accord reached by 
leaders of the HGP, which required all HGP 
research groups to deposit their sequence 
data into public databases 24 hours after 
being generated. The ethos of rapid, pub- 
lic release of genomic and related data has 
now become the norm in many fields of bio- 
medical research and has even expanded to 
the commercial sector (JZ). Voluntary data 
sharing by research institutions and corpo- 
rations has also emerged in areas such as 
testing for mutations in the breast cancer 
gene BRCA, largely in response to the pro- 
prietary data approaches taken by firms 
such as Myriad Genetics (11, 12). 

Thus, just as collective action by affected 
stakeholders has been shown by Ostrom 
and others to avert tragedies of the com- 
mons involving scarce shared resources, the 
collective action of the biomedical research 
community—governments, institutions, and 
individual researchers—developed a power- 
ful response to the potential fragmentation 
and propertization of the research environ- 
ment (17). It is likely that the success of 
these public research commons has con- 
tributed to the dearth of observed anticom- 
mons effects. 

There may be other reasons that patent 
anticommons did not take hold in the bio- 
medical sciences to the degree envisioned by 
Heller and Eisenberg. For example, in line 
with their recommendation that upstream 
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research tools be licensed broadly and non- 
exclusively, the NIH adopted a policy in 1999 
urging its grant recipients to license patented 
research tools on a nonexclusive basis to 
promote their greatest utilization, commer- 
cialization, and public availability. In 2007, a 
group of 11 major U.S. research universities 
followed suit and committed, in a set of core 
principles known as the “Nine Points,” that 
research tools should be made as broadly 
available as possible. Today, more than 100 
research institutions around the world have 
voluntarily subscribed to the Nine Points. 


“it remains the case 
that the combination of 
extensive propertization 
with fragmentation 

of ownership can lead to 
transactional gridlock 
and underutilization of 
socially valuable assets.” 


Finally, over the past decade, patent 
laws in a number of countries have be- 
come weaker, not stronger, with respect 
to the protection of upstream biomedical 
innovations. Beginning in 2010, a series of 
U.S. Supreme Court decisions clarified that 
“products of nature,” “mental processes,” 
and “abstract ideas” are not eligible for pat- 
ent protection. As a result, it has become 
increasingly difficult to patent basic bio- 
medical discoveries in the United States, 
so much so that some have begun to ask 
whether the lack of patents in certain ar- 


Published by AAAS 


al 


eas (for example, molecular diagnostics and 
personalized medicine) will itself impede 
future innovation and discovery (13). 


ANTICOMMONS ON THE HORIZON? 

Despite the absence of a serious patent an- 
ticommons in biomedical research today, 
Heller and Eisenberg’s warning should not 
be ignored. There are several areas beyond 
patent law in which the proliferation of ex- 
clusionary rights could impede biomedical 
research and product development in ways 
that are similar to those that Heller and 
Eisenberg envisioned. 

First, in the vacuum left by limitations 
on patenting human DNA, some firms have 
increasingly turned to trade-secret law to 
protect data that they collect from patients 
and test subjects (72). In most countries, 
trade-secret law gives an enforceable prop- 
erty-like right to the holder of commercially 
valuable information that is deemed to be 
confidential. And unlike patents, which 
expire after 20 years, trade-secret protec- 
tion continues in perpetuity, so long as the 
relevant information remains secret. Trade 
secrecy also challenges research in ways dif- 
ferent from patents. For example, a patent 
is an official document that publicly dis- 
closes the patented invention, thus enabling 
others to study and improve on its features 
and techniques. Trade secrets, by their na- 
ture, need never be disclosed to the world, 
thus limiting the opportunity for follow-on 
research and improvements. 

If more data are treated as secret by re- 
searchers, there may be less overall growth 
in knowledge and the medical information 
commons may not grow as anticipated 
(10). In addition, the withholding of data 
by individual researchers may give rise 
to anticommons effects. In fields that are 
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characterized by large bodies of interde- 
pendent observations, the withholding of 
individual research results may stymie the 
development of a full understanding of 
the field. This fragmentation effect is par- 
ticularly salient in the area of molecular 
diagnostic testing, in which different test- 
ing labs may each collect genetic variant 
data from patients, but the true potential 
of these data will only be realized if they 
are combined and analyzed together. The 
inability of researchers to conduct cross- 
cutting analyses could result in less accu- 
rate diagnostics and fewer therapeutics. 
Thus, although holding data privately does 
not have the same exclusionary effects 
as obtaining a patent (that is, others are 
not precluded from independently gener- 
ating the same data), the difficulties that 
emerge in consolidating data from differ- 
ent sources, coupled with the reduction in 
overall welfare arising from a lack of the 
full spectrum of results, fall close to Heller 
and Eisenberg’s anticommons. 

Eisenberg anticipated this issue in 2008, 
observing that with “practically exclud- 
able” resources such as data and biological 
samples (that is, where exclusion is based 
not on legal restrictions but instead on the 
need for one researcher to provide the re- 
source to others), it is up to users to per- 
suade owners to permit access (2). As such, 
a data-driven anticommons could emerge if 
enough researchers keep their data propri- 
etary and share them only on condition of 
payment or not at all. There are few effec- 
tive regulatory solutions to this issue under 
current law, but the continued encourage- 
ment (or requirement) of data sharing by 
research funders, health care payors, and 
leading journals could avert the worst ef- 
fects of such an anticommons. 
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Second, despite the recent tightening of 
legal requirements for patent eligibility, pat- 
enting activity continues around the world 
with respect to emerging biomedical tech- 
nologies such as CRISPR (clustered regu- 
larly interspaced short palindromic repeats) 
gene editing. The potential exists for sub- 
stantial fragmentation of the CRISPR pat- 
ent landscape—not through disaggregated 
ownership of CRISPR patents themselves 
(though that is also possible) but through 
parcelization of the CRISPR-patent estate via 
narrowly drawn licensing agreements con- 
trolled by a handful of private “surrogate” 
companies empowered by the academic re- 
search institutions that made foundational 
CRISPR discoveries (14). These companies 
may have incentives to license technology to 
others on a limited gene-by-gene, indication- 
by-indication basis that does not encourage 
the full breadth of potential research and 
product development. And although some 
CRISPR patent holders have granted favor- 
able rights to academic researchers, this so- 
lution is neither universal nor binding nor 
permanent. As a result, policy-makers should 
continue to remind academic research insti- 
tutions of their public commitments to work 
toward wide dissemination of the fruits of 
their research and to discourage the exclu- 
sive licensing of broadly applicable research 
tools. If such reminders prove to be insuffi- 
cient, research-funding agencies could more 
definitively require the broad availability of 
funded research through licensing. 

Finally, a new and potentially important 
form of fragmentation has recently emerged 
with respect to individual health informa- 
tion. A spate of recent legal disputes in the 
United States has led to increasing calls for 
personal ownership of genetic and other 
health information (15). The rationales for 
this privatization movement are varied, rang- 
ing from concerns over individual autonomy, 
privacy, and dignity, to offering a more palat- 
able alternative than corporate ownership, to 
creating a basis for data-based market trans- 
actions, and to dissatisfaction with existing 
regulatory and administrative data protec- 
tion frameworks in the United States and 
the European Union. Yet, despite the good 
intentions behind many of these proposals, 
granting individuals an enforceable property 
interest in information about themselves, in- 
cluding the right to receive compensation for 
its use, could pose considerable impediments 
to data-driven research, particularly in the 
coming era of megacohort studies involving 
a million or more individuals. 

Thus, although Heller and Eisenberg wor- 
ried that fragmented interests held by a few 
dozen or hundred patent owners could im- 
pede biomedical research, the possibility that 
millions of individual data subjects could 
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demand clearance, oversight, or payment 
to use their data in successive studies, or 
withdraw their data from existing databases 
and ongoing analyses, has far more dramatic 
ramifications for research (J5). To avert this 
version of the anticommons, policy-makers 
and courts should continue to resist calls for 
individual data ownership and look instead 
to meaningful regulatory and legal measures 
to protect individuals against invasions of 
privacy and abuse by researchers. 


BROADER LESSONS 

Though the patent-driven biomedical anti- 
commons envisioned by Heller and Eisen- 
berg does not appear to have emerged widely, 
researchers and policy-makers must remain 
vigilant as new sources of potential anticom- 
mons emerge. Admittedly, the precise param- 
eters of exclusivity generated by new forms of 
property fragmentation vary, and they do not 
all exhibit the same features, or lend them- 
selves to the same solutions, as patent-based 
anticommons. Nevertheless, it remains the 
case that the combination of extensive prop- 
ertization with fragmentation of ownership 
can lead to transactional gridlock and under- 
utilization of socially valuable assets. 

Thus, it is worth looking beyond bio- 
technology patents to the more general dis- 
tinction between fragmented and common 
ownership models. Most importantly, no 
matter how unobjectionable the legal and 
commercial rationales underlying properti- 
zation strategies may first appear, attention 
should be given to the downstream effects 
that potential anticommons could cause. 
If serious impediments to socially valuable 
activity, including biomedical research, are 
likely to result, then policies limiting the 
impact of such anticommons should be con- 
sidered and weighed in the balance. 
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Nature's riverkeepers 


Recognizing their role in maintaining healthy watersheds, 
“beaver believers” work to rehab the rodent’s reputation 


By Sarah Boon 


hy should we care about beavers? 

Consider all they do. Beavers 

convert vegetation to marsh to 

wetland and back again. They 

facilitate water storage in ponds 

and recharge groundwater. Ponds 

and meadows sculpted by beavers concen- 

trate nutrients such as nitrogen 

and phosphorus. Not only does 

this create fertile ground, it 

helps filter agricultural runoff. 

Beaver-dammed landscapes cre- 

ate habitats for other species, 

and their complexes can serve 
as wildfire breaks. 

Researchers have calculated 


Fager 


BEAVERS 
MATTER 


a 


the 1920s and 1930s, but never to their 
original numbers. Eager, by Ben Goldfarb, 
examines how they’e getting along today. 
Beavers still face obstacles when we at- 
tempt to reintroduce them into ecosystems 
in which they once flourished. Predators can 
eat a beaver for lunch, while cattle grazing 
removes vegetation and can alter a stream’s 
configuration in such a way that it can no lon- 
ger support beaver populations. 
Even wild ungulates can be a 
detriment to beavers. Goldfarb 
describes how the reintroduc- 
tion of wolves into Yellowstone 
National Park led to the recovery 
of vegetation and rerouting of 
stream channels, which some re- 
searchers maintain was the result 


that between 15 million and 250 en of an increase in the availability 
million beaver ponds once cCOV- — ChelseaGreen, 2018, Of streamside willows and alder 
ered North America. A boom- 302 pp. for beavers. But one of the main 


ing 19th-century trade in beaver 
pelts, along with a loss of habitat due to 
settlement, almost wiped them out. The 
landscape they helped to shape was a wa- 
tery quagmire—and a classic example of 
shifting baselines. Because we didn’t notice 
the initial profusion of beavers in the wild, 
we think current populations are normal. 
Beaver populations recovered through 
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barriers to beaver reintroduction 
is policy, especially in states such as New 
Mexico, where legislators fear beaver activity 
will harm the cattle industry. 

Eager also highlights the problems of 
preconceived assumptions about beavers 
and beaver management. A common sup- 
position among fish ecologists, for example, 
is that beavers are bad for salmon because 
their dams prevent the fish from swim- 
ming upstream. However, Goldfarb cites a 
comprehensive review of 108 papers that 
showed that beavers benefit fish popula- 
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Some researchers believe that beavers were critical 
to promoting ecosystem recovery in Yellowstone. 


tions more often than they cause negative 
consequences. A more egregious error oc- 
curred in California in 1937, when ecologist 
Joseph Grinnell declared that beavers had 
never inhabited large portions of the state. 
Although this was untrue, it went unchal- 
lenged and affected beaver recovery and 
ecosystem management across the state un- 
til it was disproven in 2012. 

With drought in the southwest United 
States, there’s talk of building new dams. 
Beavers could do that for us—and create a 
fully functioning ecosystem while they're 
at it. Goldfarb visits a property in Nevada, 
where a local rancher was able to extend his 
water availability by two months by allowing 
beavers to colonize one of his watersheds. 

Later in the book, Goldfarb suggests that 
we could combine the heavy-duty approach 
of river restoration (such as use of backhoes 
and rip rap) with the work of beavers by 
installing beaver dam analogs (BDAs): two 
posts hammered into the ground with twigs 
woven between them (Science, 8 June, p. 
1058). The BDAs, he argues, would speed up 
ecological recovery by encouraging beavers 
to colonize designated watersheds. 

Goldfarb speaks largely with “beaver be- 
lievers’”—individuals who try to help humans 
and beavers coexist by mitigating the impact 
of beavers on the built landscape and by rein- 
troducing them into stream systems that they 
can potentially restore. He lets his interview- 
ees tell the majority of the story, recalling, for 
example, Councilman Mark Ross’s interac- 
tion with a local businessman during a tense 
meeting of pro- and anti-beaver groups in 
Martinez, California (“This seventy-year-old 
guy is about to hit me! ...Do I hit back against 
a senior citizen or not?’”) and Yellowstone sci- 
entist Dan Kotter’s description of the effects 
of bison grazing on riparian vegetation (“like 
Jabba the Hutt eating a piece of pizza’”). 

Goldfarb ends the book with a trip to the 
United Kingdom, where beavers haven’t been 
seen since the 17th (Scotland) and late 18th 
(England) centuries. Here, reintroduced 
beavers are a huge tourist draw, and beaver 
dams reduce the impacts of flooding—a big 
problem in the UK—although many farmers 
aren’t convinced. As Goldfarb writes, “Every- 
one shares a goal; no one agrees on strategy.” 

One thing Eager was missing was a visit to 
Canada. The beaver is the country’s national 
animal and graces its nickel. It has acres of 
landscape shaped by beavers, and Goldfarb 
cites a number of Canadian studies, but 
a firsthand experience would surely have 
enriched his otherwise excellent story. 


10.1126/science.aat7938 
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SOCIAL SCIENCE 


In vivo we trust 


An engaging synthesis highlights the value of 
field experiments in the social sciences 


By John A. List 


ocial scientists once restricted their 

research to carefully controlled labo- 

ratory experiments. Over the past 25 

years, however, they have increas- 

ingly made use of field experiments. 

Insights gained have spanned nearly 
every imaginable segment of our society, 
lending tests of theory, advice to policy- 
makers, and guidance to nonprofit and for- 
profit firms alike. 

In his new book, Randomistas, Andrew 
Leigh takes stock of a slice of this research 
in an even-tempered, scientific, and acces- 
sible way. The work reads like a stroll down 
memory lane, as Leigh digs into historically 
rich areas of research, ranging from the ed- 
ucation production function to crime pre- 
vention and useful poverty interventions. 
Yet perhaps the most exciting aspect of the 
book concerns field experiments in politics 
and philanthropy. 

Politics is interesting in its own right. 
Why people vote and how we can enhance 
voter participation remain first-order ques- 
tions in well-functioning democracies. In 
chapter 9, Leigh cleverly illustrates how 
Barack Obama made keen use of field ex- 
periments to figure out what was working 
and why during his first presidential cam- 
paign. For instance, in 2007 Obama’s team 
used a field experiment to determine the 
best image and slogan to use to motivate 
web visitors to subscribe to future cam- 
paign emails. The version that yielded the 
most email addresses—a black and white 
photo with the message “Learn More”—sur- 
prised even the most seasoned experts, who 
expected that a video accompanied by the 
message “Sign Up” would be the top per- 
former, and served as a scientific basis for a 
successful campaign. 

Later, Leigh describes how political sci- 
entists Alan Gerber and Donald Green 
determine the efficacy of get-out-the-vote 
interventions. As Leigh, a politician him- 
self, points out: “I’ve met ‘experts’ who are 
convinced that partisan letters work best 
when paired with doorknocking, that tele- 
phone calls work best in the final week of 
the campaign, or that posters outside the 
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election booth make a huge difference. But 
ask them about their evidence base and it’s 
quickly apparent that their war stories lack 
a control group.” 

In their most effective intervention, Ger- 
ber and Green show that a letter revealing 
the recipient’s turnout record, as well as 
their neighbor’s, increases turnout by 8 per- 
centage points. This remarkable effect likely 
reflects social-image concerns. 

Charitable giving, meanwhile, is much 
more important than most people realize. 
The number of U.S. nonprofits registered 
with the Internal Revenue Service grew by 
nearly 60% from 1995 to 2005, and charita- 
ble gifts of money have more than doubled 
since 1990, now exceeding 2% of GDP. 

The market for charitable giving primar- 
ily revolves around three major players: 
donors, who provide the resources to chari- 
ties; charitable organizations, which de- 
velop strategies; and the government, which 
decides (among other issues) the tax treat- 
ment of individual contributions, the level of 
government grants given to various charities, 
and what public goods to provide itself. 

Leigh catalogs interesting details of 
recent field experiments, which lend in- 
sights into all three actors, but he focuses 
most of his efforts on the relationships be- 
tween charities and individuals. Here, he 
summarizes some of my own field experi- 
ments from the 1990s, which showed the 
importance of a “lead donor” and match- 


Randomistas 

How Radical Researchers 
Are Changing Our World 
Andrew Leigh 

Yale University Press, 
2018. 283 pp. 


ing funds in raising charitable giving. (The 
mere mention of a lead donor raised giving 
by 50 to 100%.) 

What is particularly appealing about 
Randomistas is that it does not stop at dis- 
cussing the overarching literatures associ- 
ated with the topics Leigh chooses to focus 
on. It also details how to build a better feed- 
back loop and how organizations can get 
over hurdles that prevent them from run- 
ning effective interventions. These include 
fairness concerns, replication needs, and 
the discomfort associated with admitting 
when you do not know something. (The lat- 
ter is a challenging one for most managers 
I know.) 

I do not have any qualms with what was 
written in this book, but I do feel that an 
important element of the field experiment 
equation was omitted. In the past few de- 
cades social scientists have done a superb 
job of developing methods with which to 
generate field data showing how the world 
works and detailing intervention effects. 
However, how we should use the data for 
policy purposes is often neglected. Do the 
results scale to a larger setting? What are 
the factors that affect that scaling? Without 
this information, empirical research can 
be quickly undermined in the eyes of the 
policy-maker, the broader public, and even 
within the scientific community. & 
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PODCAST 


The Secret Token 
Myth, Obsession, and the Search 
for the Lost Colony of Roanoke 


Andrew Lawler 
Doubleday, 2018. 410 pp. 


Delayed for 3 years by an ongoing 
war, John White returned to the 
American colony of Roanoke on 18 
August 1590 but found the settlement 
deserted. The only clue was the word 
“Croatoan” carved into a fence post. 
This week on the Science podcast, 
Andrew Lawler recounts the mystery 
of the Lost Colony and shares stories 
of those who've devoted their lives to 
finding it. 


Bluffs at Albemarle Sound are rapidly eroding into the ocean, 
complicating efforts to find clues about the Lost Colony. 
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Brazilian killifishes 
risk extinction 


The annual killifishes are a diversified 
group of small fish in Africa and South 
America (J). Annual killifish are unique 

in their ability to survive in ephemeral 
pools; they have short, seasonal life cycles, 
laying eggs that lie dormant in the soil 
once the pools have evaporated and then 
resume development and hatch when the 
water returns (2, 3). In Brazil, these fish 
are found in all biomes and have been 
recorded in the most variable seasonal 
freshwater environments (2). As a result 

of habitat loss, restricted distribution, and 
low dispersal of most species, annual kil- 
lifishes face strong threats of extinction (4, 
5); they are the most endangered group of 
fish in Brazil (5, 6). In a 2014 evaluation of 
endangered Brazilian fauna, according to 
the International Union for Conservation 
of Nature (IUCN) criteria (5), 102 spe- 

cies of annual killifishes were considered 
endangered, representing one-third of all 
threatened freshwater fish in Brazil. Some 
of these species are no longer found in 
nature and possibly extinct (7). 

Several factors are responsible for the 
high degree of threat to killifish. The use 
of land for urbanization, agriculture, and 
hydroelectric dams has led to the loss and 
degradation of the pools in which the fish 
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live (5—7). Many environmental agencies 
lack knowledge about the species and 
their habitats and sometimes have been 
negligent in their efforts to protect them 
(8). Because few killifish species live in 
protected areas, many of Brazil’s conserva- 
tion efforts have not benefited them (8). 
Consequently, annual killifish populations 
have declined rapidly in Brazil (7, 8). 

The problem is even greater because 
Brazil’s policies hamper biodiversity 
conservation (9). The Brazilian Forest Act 
reduced the area of permanent preserva- 
tion and overlooked ephemeral aquatic 
environments (9). Other policies, such as 
the New Law on Biodiversity, make access 
to biodiversity difficult and bureaucratic, 
thereby weakening universities and 
research institutions (J0). Threats also 
include assaults on the environmental 
licensing system and disenfranchise- 
ment of environmental agencies (77). In 
a country where charismatic species are 
constantly suffering from population 
decline (12) and conservation research is 
under attack, there is little energy or atten- 
tion left for small fish that inhabit extreme 
environments and are unknown to society. 

Despite a national conservation action 
plan that has led to some small advances 
and greater visibility of killifishes in recent 
years (4), there is still much that needs to 
be done to mitigate these serious threats 
to Brazilian killifish biodiversity. Brazil 
must pass laws to protect temporary 
wetlands and facilitate public and private 
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Negligence and habitat 
loss are among the threats 
that have rendered annual 

killifishes the most 
endangered fish in Brazil. 


investments in scientific research and envi- 
ronmental education. Most urgently, the 
government must act to restore degraded 
areas and establish protected areas to save 
these species from extinction in Brazil. 
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“Deadly mosquito” 
or “living freshwater”? 


Organizations fighting disease often use 
catchy taglines such as “Which animal kills 
most people?,” as the Gates Foundation 
did at its 2018 annual campaign against 
malaria (1). These questions pique our curi- 
osity and anxiety and remind us that we 
can prevent disease by protecting ourselves 
from mosquitoes. However, casting these 
aquatic insects as villains could set back 
freshwater and invertebrate conservation. 
We, therefore, call for disease-focused 
organizations to adopt a message that puts 
human health more clearly in its environ- 
mental context. 

Freshwater is life’s and humankind’s 
most critical resource, and shifts in pre- 
cipitation and evaporation due to climate 
change directly affect aquatic habitats (2). 
As a result, it is also the environment with 
the highest proportion of threatened species 
(3). Invertebrates make up the vast majority 
of animal species, and insect populations 
are rapidly declining (4). Nonetheless, pro- 
tecting this biodiversity is often an uphill 
battle, given that insects do not evoke the 
appreciation that benefits other ecosystems 
and species, such as forests and whales. 
Blood-sucking, disease-bearing insects are 
understandably unpopular, but exploiting 
our fear of them perpetuates the prejudice 
against their habitats. 

Moreover, vilifying insects may detract 
from humans’ own role in promoting 
mosquito-borne disease. Just a fraction 
of the 3500 known species of mosquitoes 
transmit the pathogens truly responsible 
for malaria, dengue, chikungunya, and 
other human diseases (5). These vectors 
have adapted to feed primarily on humans 
by developing in human-disturbed habitat 
that is often inhospitable to their natural 
predators. For example, we litter our yards 
with the rainwater-retaining plastic in 
which some of the most problematic mos- 
quitoes thrive (6). 

We encourage disease-focused orga- 
nizations to work with ecologists and 
conservationists to expand their campaigns 
with the many positive messages that a 
healthy environment carries. Plastic-free 
yards would reduce the need for harmful 
pesticides, for instance (6). Wetlands are 
also home to attractive emissaries that eat 
mosquitoes, such as colorful dragonflies. 
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The ultimate symbol of health, of course, is 
clean freshwater itself, undeniably human- 
ity’s most precious resource. 
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Where there is fire, 
there is smoke 


In his In Depth News story “Scientists aim 
to smoke out wildfire impacts” (1 June, 

p. 948), W. Cornwall discusses wildfire- 
generated smoke and the associated large 
quantities of particulate matter. We agree 
that research on smoke demands more 
sustained scientific attention, particularly 
given its impacts on human communities 
[e.g., (1)]. However, not all fires are created 
equal in terms of their (natural or deliber- 
ate) origins, the amount of biomass they 
consume, and the amount of smoke they 
generate as a result. 

High-intensity wildfires (those exceeding 
200,000 kW/m?) in very high biomass for- 
ests may consume 9 to 14% of the biomass 
(burning green parts of living trees but 
often leaving dense wood) or 40 to 58 tons 
per hectare (2). This is substantially less 
than half of the 140 to 450 tons of biomass 
per hectare consumed when logging slash 
(i.e., dense waste wood) is burned after 
logging operations in these same kinds of 
forest (3, 4). 

Moreover, such logging burns consume 
at least 10 times as much biomass as 
hazard reduction burns designed to reduce 
wildfire risk (5). The smoke generated from 
logging therefore represents a substantial 
form of industrial pollution, but it is not 
treated as such. Beyond improving efforts 
to quantify the chemical composition 
of smoke and its long-term impacts on 
human health, there is an urgent need to 
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examine the problem in a more holistic 
manner. This demands far more careful 
consideration of the origins of smoke, the 
relative amounts of smoke originating from 
different kinds of fires, and the forest and 
land use policies that generate smoke (and 
might reduce it). 
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Comment on “An excess of massive stars 
in the local 30 Doradus starburst” 


Will M. Farr and Ilya Mandel 

Schneider et a/. (Reports, 5 January 2018, p. 
69) used an ad hoc statistical method in their 
calculation of the stellar initial mass function. 
Adopting an improved approach, we reana- 
lyze their data and determine a power-law 
exponent of 2.05*94 Alternative assumptions 
regarding dataset completeness and the star 
formation history model can shift the inferred 
exponent to 2.11°918 and 2.15*045, respectively. 
Full text: dx.doi.org/10.1126/science.aat6506 


Response to Comment on “An excess of mas- 
sive stars in the local 30 Doradus starburst” 


F. R. N. Schneider, H. Sana, C. J. Evans, 

J. M. Bestenlehner, N. Castro, L. Fossati, 

G. Grafener, N. Langer, O. H. Ramirez- 
Agudelo, C. Sabin-Sanjulian, S. Sim6én-Diaz, 
F. Tramper, P. A. Crowther, A. de Koter, S. E. 
de Mink, P. L. Dufton, M. Garcia, M. Gieles, 
V. Hénault-Brunet, A. Herrero, R. G. Izzard, 
V. Kalari, D. J. Lennon, J. Maiz Apellaniz, 
N. Markova, F. Najarro, Ph. Podsiadlowski, 
J. Puls, W. D. Taylor, J. Th. van Loon, J. S. 
Vink, C. Norman 

Farr and Mandel reanalyze our data, finding 
initial mass function slopes for high-mass 
stars in 30 Doradus that agree with our 
results. However, their reanalysis appears 

to underpredict the observed number of 
massive stars. Their technique results 

in more precise slopes than in our work, 
strengthening our conclusion that there 

is an excess of massive stars (>30 solar 
masses) in 30 Doradus. 

Full text: dx.doi.org/10.1126/science.aat7032 
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Comment on “An excess of 
massive stars in the local 30 


Doradus starburst” 


Will M. Farr’?* and Iya Mandel”** 


Schneider et al. (Reports, 5 January 2018, p. 69) used an ad hoc statistical method in their 
calculation of the stellar initial mass function. Adopting an improved approach, we 


0.14 


reanalyze their data and determine a power-law exponent of 2.05'573. Alternative 
assumptions regarding dataset completeness and the star formation history model can 


0.19 


shift the inferred exponent to 2.11°535 and B50, respectively. 


chneider et al. (1) use spectroscopic ob- 

servations of young massive stars in the 

30 Doradus region of the Large Magellanic 

Cloud to infer a shallower-than-expected 

stellar initial mass function (IMF) with a 
power-law exponent of a = 1.90°337, in con- 
trast to the Salpeter exponent of 2.35 (2). They 
estimate the ages and masses of individual stars 
with the BONNSAI Bayesian code (3), then ob- 
tain an overall mass distribution by effectively 
adding together the posterior probability den- 
sity functions of individual stars. There is no 
statistical meaning to a distribution obtained 
in this way, which does not represent the pos- 
terior probability density function on the mass 
distribution. 

Hierarchical Bayesian inference provides the 
statistically justified solution to this problem (4). 
Mandel (5) specifically considered inference on 
a mass distribution given a sample of uncertain 
measurements, and we use a similar method- 
ology here. We interpret the Schneider et al. 
inference on individual masses and ages as in- 


Fig. 1. The posterior on the IMF 

power-law exponent a is inferred 5 
from the observations d. See text 

for details on the four models. Blue: 

Schneider et al. [S+ (1)] stellar 4 
lifetimes (I.t.), survey completeness 
for M = 15Mo, and Gaussian star 
formation history model; green, 

same but with our lifetime fit; orange, 
same as green but with completeness 
for M = 20M.; pink, same as blue 2 
but with a double-exponential 

star formation history model. The 

Salpeter power-law exponent is 1 
-a = —2.35 (2), indicated by a vertical 

black line. The 68.3% range of power- 

law exponents derived by Schneider et al. 0 
is shaded in gray. 


p(a | d) 
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dependent Gaussian likelihoods for the logarithm 
of the mass and the age, with parameters fixed 
by matching the mean parameter to the peak 
and the standard deviation parameter to the 
68% width of the individual stellar distribu- 
tions in the Schneider et al. data. 

For our fiducial analysis, we model the star 
formation history as a truncated Gaussian dis- 
tribution, and generally find a star formation 
history similar to that in Schneider e¢ al., with 
the star formation rate in 30 Doradus peaking 
about 4 million years ago. We impose broad 
priors on the power-law exponent and the mean 
and standard deviation of the star formation 
Gaussian. We use the Hamiltonian Monte Carlo 
sampler STAN (6) to efficiently address the high- 
dimensional hierarchical problem with free pa- 
rameters for each star’s actual mass and age in 
addition to the IMF exponent and the mean and 
standard deviation of the star formation history. 

Figure 1 shows the inferred power-law expo- 
nent of the IMF. We use the Schneider e¢ ai. fit 
to stellar lifetimes and assume that their dataset 


S+1.t.,15Mo.,norm SFR 
1 ourl.t, 15M .,norm SFR 

our I.t., 20M, norm SFR 

S+ 1t.,15Mo_, exp SFR 


1.50 1.75 2.00 2.25 2.50 2.75 
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is complete above 15 solar masses (M,); that is, 
we select only those stars whose observed mass 
is above 15M, (7, 8). We find an exponent of 
a = 2.05343 where the quoted value corre- 
sponds to the median of the posterior distribution 
and the range to the 16th and 84th percentiles 
(i.e., the symmetric 68% credible interval). Hier- 
archical Bayesian modeling steepens the preferred 
IMF slope; our median a value lies about 1o above 
the preferred value from Schneider e¢ al. This 
analysis narrows the uncertainty interval by more 
than a factor of 2. 

The analysis above uses the same assump- 
tions as Schneider et al. Below, we consider the 
impact of three additional assumptions: the 
stellar lifetime fit, the choice of the complete- 
ness limit, and the model for the star formation 
history. 

We performed an independent fit to the main- 
sequence lifetimes tys of nonrotating massive 
stars of mass M as modeled by Brott et al. (9) 
and Kohler et al. (10): 


TS (mM) M 
In—— = 9.1 — 3.8955 In — 
n Myr 9.1973 — 3.89 Me =I 
2 3 
0.6107 (1m a — 0.0332 (in im) (1) 
Mo Mo 


Following Schneider et al., we increased the 
“observable” lifetime of a star by 10% beyond its 
main-sequence lifetime to account for helium 
burning. We find that this alternative fit does 
not affect the inferred IMF, yielding the same 
power-law exponent o = 2.057014, 

The inferred power-law exponent is somewhat 
sensitive to the choice of the cutoff mass for sur- 
vey completeness. The data of Schneider e¢ ai. 
show a relative scarcity of stars between 15M> 
and 20M,; changing the mass cutoff from 15M> 
to 20M, further steepens the inferred exponent 
toa = 2.11'01?. However, these fluctuations are 
within the expected statistical variation based 
on the sample size, as confirmed with pos- 
terior predictive checking. In particular, there 
is no statistical evidence against the claim of 
Schneider e¢ al. that the survey is complete for 
M = 15Mo. 

Finally, we considered an alternative star for- 
mation history model—a double exponential with 
three free parameters: the time of the peak of the 
star formation rate, and the (possibly different) 
decay constants before and after the peak. This 
model allows for a sharper peak and longer tails 
than a Gaussian. This star formation rate history 
model is consistent with the data, as tested with 
posterior predictive checking (see below). How- 


e . 0.13 
ever, it yields a power-law exponent o = 2.15 oa 
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Fig. 2. Posterior predictive checking demonstrates that the observed data are consistent 
with being drawn from our model. The observed distribution of (maximum likelihood) masses 

M (A) and ages t (B) are shown as black curves; distributions of mass and age from synthetic data 
drawn from our fitted model (i.e., the posterior predictive distribution) are shown as blue curves 
(median) and shaded blue regions (68% and 95% credible intervals), respectively. 


almost lo steeper than for our fiducial analysis. 
This indicates that the inferred IMF is sensitive to 
the systematics of the assumed star formation 
history model. 

We also considered the possibility that the IMF 
power law has an additional break at higher 
masses, allowing for three free parameters: the 
mass at which the break happens, and the ex- 
ponents below and above the break. However, 
we find that the data do not constrain the pa- 
rameters of this more general model, and there 
is no statistical preference for a broken power- 
law model. 

We confirmed the stability of our conclusions 
with posterior predictive checking. Figure 2 
shows the distribution of observed masses and 
ages (i.e., the peak of the likelihood) from the 
Schneider et al. data overlain on the range of 


Farr and Mandel, Science 361, eaat6506 (2018) 


mass and age distributions that would be ob- 
served from a large number of datasets drawn 
according to our fitted fiducial IMF model. The 
data are consistent with being drawn from our 
model. We have also confirmed that all of our 
models yield predictions for the numbers of 
stars heavier than 30M, and 60M, that are 
consistent with observations. 

We find that we can substantially reduce the 
statistical uncertainty in the IMF by applying an 
improved statistical analysis to the observations 
of young massive stars in 30 Doradus. However, 
the systematics from modeling uncertainties, 
such as the assumed star formation history 
model, can potentially shift the inferred power- 
law exponent by more than the statistical un- 
certainty. Furthermore, we adopted the mass 
and age posteriors for individual stars directly 
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from Schneider et al. Imperfect stellar models 
or the inclusion of other complicating factors 
described by Schneider et al. (rotation, mass 
transfer, mergers, etc.) introduce further system- 
atic uncertainty that could again shift the inferred 
IMF exponent. The combination of these factors 
makes it very challenging to infer the precise 
shape of the IMF even when a dataset as good 
as that obtained by Schneider e¢ ai. is available. 
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Response to Comment on “An excess 


of massive stars in 


the local 30 


Doradus starburst” 


Fabian R. N. Schneider’, Hugues Sana”, Christopher J. Evans”, 

Joachim M. Bestenlehner*’*, Norberto Castro®, Luca Fossati’, Gétz Grafener®, 
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Farr and Mandel reanalyze our data, finding initial mass function slopes for high-mass stars 

in 30 Doradus that agree with our results. However, their reanalysis appears to underpredict the 
observed number of massive stars. Their technique results in more precise slopes than in 

our work, strengthening our conclusion that there is an excess of massive stars (>30 solar 


masses) in 30 Doradus. 


arr and Mandel (/) reanalyzed the results 

of our study (2), in which we investigated 

the star formation history (SFH) and stellar 

initial mass function (IMF) of the local 30 

Doradus (30 Dor) starburst in the Large 
Magellanic Cloud and found an overabundance 
of stars with initial mass exceeding 30 solar 
masses (M.). They use an alternative and po- 
tentially more powerful statistical framework, 
hierarchical Bayesian inference, and infer IMF 
power-law indices for massive stars that are in 
agreement with our results (compare the IMF 
slope distributions in their figure 1 to the 1o range 
inferred in our analysis). Their analysis allows 
them to infer the IMF slope with higher pre- 
cision than was possible in our case, such that 
their inferred IMF slope for high-mass stars in 
30 Dor is shallower than that of a Salpeter IMF 
(3) with an even larger confidence (more than 
95.5%, versus 83% in our analysis). Their re- 
analysis therefore supports our main findings 
and conclusions about the IMF in 30 Dor. 


Farr and Mandel’s main criticism of our work 
is that “[tJhere is no statistical meaning to [age 
and mass] distribution[s] obtained” by adding 
the posterior probability distributions of the 
ages and initial masses inferred for individual 
stars. It is true that such distributions are not 
posterior probability functions in a Bayesian 
framework. However, we caution that the IMF 
is historically defined as a histogram of stellar 
masses (3-9) and our procedure to add the pos- 
terior probability distributions of the initial masses 
of individual stars is the equivalent of computing a 
histogram for the mass distribution of a sample of 
stars, while taking into account the observational 
uncertainties of individual mass estimates. Virtu- 
ally all IMFs inferred in the literature are 
constructed in this way, so Farr and Mandel’s 
criticism implicitly applies to those as well. The 
VLT-FLAMES Tarantula Survey (VFTS) (10) has 
reached a completeness of about 73% with respect 
to amore complete census (7) of massive stars in 
30 Dor (see figure S2 in our original work). For a 


complete stellar sample, the age distribution of 
stars obtained with our method would directly 
provide the SFH at the youngest ages where even 
the most massive stars did not yet end their nu- 
clear burning lifetime—so there is also meaning 
to age distributions constructed as was done in 
our work. 

We have tested our statistical analysis with 
mock data. To this end, we sampled a stellar 
population of 1000 stars more massive than 
15M, for a given Salpeter high-mass IMF with 
slope y = —2.35 and a continuous SFH (constant 
star formation rate). In this way, we have ob- 
tained Gaussian distributions of the ages and 
masses of individual mock stars with lo uncer- 
tainties of 20% and 15% in age and mass, re- 
spectively. These uncertainties are characteristic 
of the age and mass uncertainties of stars in our 
sample of 30 Dor stars (2). We then used exactly 
the same analysis technique as in our original 
work to infer the IMF and SFH of the mock 
star sample. The results of this test are shown in 
Fig. 1 and demonstrate that our analysis method 
is able to reproduce the underlying SFH and 
IMF of the mock stars. For comparison, we show 
the distribution of initial masses for an IMF with 
slope y = -1.90 to illustrate that our analysis 
technique can distinguish between a Salpeter 
IMF slope of y = -2.35 and a shallower slope 
of y = -1.90. This test further shows that both 
IMFs reproduce the mock data similarly well in 
the mass range 15 to 30M, and that the high- 
mass end (>30M,) of the distribution of mock 
masses has the largest power to constrain the 
high-mass IMF slope (Fig. 1C). 

Our analysis of the VFTS data relies on two 
different techniques to infer the high-mass end 
of the IMF: (i) by fitting the observed distribu- 
tion of stars in the mass range 15 to 200M. and 
(ii) by fitting the number of stars more massive 
than 30 and 60M. Both procedures give results 
that are in good agreement (2). From the in- 
ferred masses and corresponding uncertainties 
of our sample stars, we find 75.9$% stars above 
30Mz and 22.243 stars above 60Mz (2). Con- 
trary to what Farr and Mandel write in their 
reanalysis, their online data (https://github.com/ 
farr/30DorIMF, as accessed on 6 May 2018) sug- 
gest that their best-fitting SFH and IMF models 
underpredict the observed number of massive 
stars. They predict on average ~65 stars above 
30M, and ~18 stars above 60M. Their ratio of 
the number of stars greater than 30M, to the 
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number of stars greater than 60M, (~3.6) is 
larger than what we have observed in 30 Dor 
(~3.4), which appears to be consistent with Farr 
and Mandel inferring slightly steeper IMF slopes 
than we did in our analysis. Indeed, using our 
SFH model and the results of our fitting method 
(ii), the numbers of massive stars above 30M5 
and 60M, as predicted by Farr and Mandel are 
found for an IMF slope of about y = -2.10 [figure 
2 in our original work (2)]. This is consistent 
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Mini / Mo 


with their best-fitting IMF slopes of y = -2.05 to 
-2.15 for the different SFH models. 

The reanalysis of Farr and Mandel gives sys- 
tematically steeper IMF slopes than in our work 
and consequently seems to underpredict the 
observed number of massive stars in 30 Dor. We 
do not know the cause of this discrepancy. Our 
methodology appears to be robust, and the only 
other obvious difference in the two approaches— 
besides the statistical framework—is the assump- 
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tion on the SFH. We directly infer the SFH from 
the data without making assumptions on its func- 
tional form. Farr and Mandel assume Gaussian 
and exponential SFH models that provide more 
degrees of freedom than in our case, and find IMF 
slope differences of Ay ~ 0.1 depending on the 
assumed SFH model. This is a systematic un- 
certainty that we did not discuss in our original 
work, and that makes the inference of the IMF 
of composite stellar populations even more 
challenging. 
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MODELS 


By Michael Funk, Colin Norman’, Keith T. Smith, 
Jelena Stajic, and Jake Yeston 


cientists learn about the world by observing, manipulating, 
measuring, and abstracting. To make sure that they truly 
understand their system, and to gain insight beyond what 
experimental data can provide, many also turn to building 
mathematical models. Some models are based directly 

on fundamental physical laws, but most rely on approxima- 
tions. The computational costs vary widely—from exactly 
solvable models to those that require all the computer 
power you can get. This power has become increasingly 
accessible; throw in the recent forays of artificial intelligence 

into scientific computing and the tantalizing promise of quantum 
computers, and it’s clear that exciting times are ahead. 

this special issue, we highlight several areas of scientific com- 

r that have seen recent progress. A feature article by Science 

r Paul Voosen focuses on an ambitious and controversial proj- 
cked by high-tech philanthropists, to leverage breakthroughs 
ificial intelligence, satellite imaging, and high-resolution simu- 
ations to create a new climate model. Kent and Kotliar reflect on 
two complementary approaches to predicting the properties of mate- 
rials in which electrons are inextricably correlated with one another. 
Bottaro and Lindorff-Larsen report on the simulation of biological 
molecules, where recent progress has been driven by advances in the- 
ory and methods and tighter integration of experimental constraints. 
Sanchez-Lengeling and Aspuru-Guzik review approaches that use 
machine learning to design chemical structures that bear a desired 
characteristic. Finally, Brigmann describes how to simulate the com- 


The availability 
of computational plex equations that govern four-dimensional spacetime in Einstein’s 


resources enables theory of general relativity, focusing on the example of gravitational 


the simulation of 
increasingly intricate 
models in many 


waves produced by merging black holes. 


fields of science. *Colin Norman is a former Science News Editor. 
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ometimes it seems the clouds over 
climate science just won’t lift. Com- 
puter models of Earth’s climate 
have multiplied in number, com- 
plexity, and computational power, 
yet they remain unable to answer 
more precisely some of the ques- 
tions most on the public’s mind: 
How high must we build sea walls 
to last until 2100? How bad will heat waves 
get in the next decade? What will Arctic 
shipping routes look like in 2030? Climate 
models all agree that global temperatures 
will continue to rise in response to human- 
ity’s greenhouse gas emissions, but 
uncertainties stubbornly persist over 
how quickly that will happen and how 
high temperatures will go. 

Tapio Schneider, a German-born 
climate dynamicist at the California 
Institute of Technology (Caltech) in 
Pasadena, believes climate science 
can do better. And he’s not alone. 
Later this summer, an academic con- 
sortium led by Schneider and backed 
by prominent technology philanthro- 
pists, including Microsoft co-founder 
Paul Allen, will launch an ambitious 
project to create a new climate model. 
Taking advantage of breakthroughs 
in artificial intelligence (AD, satellite 
imaging, and high-resolution simula- 
tion, that as-yet-unnamed model—the 
Earth Machine is one candidate—aims 
to change how climate models render 
small-scale phenomena such as sea ice 
and cloud formation that have long 
bedeviled efforts to forecast climate. 
A focus will be on the major source 
of uncertainty in current models: the 
decks of stratocumulus clouds that 
form off coastlines and populate the 
trade winds. A shift in their extent 
by just a few percentage points could 
turn the global thermostat up or 
down by a couple of degrees or more 
within this century—and current 
models can’t predict which way they will go. 

Within 5 years, the team hopes its AI- 
fortified model will drive out that un- 
certainty and others by learning on its own 
how clouds behave, from both actual obser- 
vations and purpose-built cloud simulations. 
It’s a lofty goal, Schneider admitted late one 
May afternoon in sun-soaked Pasadena, sit- 
ting outside with his newly assembled team. 
They had just wrapped up a workshop, the 
third he had convened in the past year, bring- 
ing together leading climate scientists and 
engineers to discuss the future of their field. 
“We're under no illusions,’ Schneider said. 
“This is not going to be a cakewalk” 

There are reasons for skepticism. The 
United States already has many climate 
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models, and some people question why it 
needs another, further dividing resources. 
Others question the technology and won- 
der whether the philanthropists backing 
the project have given it the scrutiny that 
an agency such as the National Science 
Foundation would provide. The team’s un- 
orthodox message and means won’t make 
it easy to win people over, says David 
Randall, a climatologist at Colorado State 
University in Fort Collins. “I think the exist- 
ing modeling centers will push back. If Tapio 
is getting funding, that in principle could 
have gone to someone else.” 


Tapio Schneider hopes his planned global climate model will reduce 
uncertainties over how high and how fast temperatures will rise. 


CLIMATE MODELERS have always followed 
two imperatives. First, they’ve folded ever 
more features of Earth into their simulations. 
Models once contained only the atmosphere 
and ocean; now, they have subroutines for 
ice sheets, land use, and the biosphere. 
Second, they've sought higher and higher 
resolutions—modeling —_ interactions on 
smaller and smaller scales—riding the wave 
of Moore’s law on government-owned super- 
computers. By one estimate, the computing 
power those models use has increased by a 
factor of 100 million since the 1970s. As the 
models grew increasingly complex, they more 
fully reflected the vagaries of our planet— 
unknown unknowns turned to known un- 
knowns. Yet the uncertainties remained. 
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At their most basic, all the models work 
the same way: They take the globe and 
chop it into a mesh, with cells some 25 kilo- 
meters to 50 kilometers on a side, and use a 
set of code called a dynamical core to simu- 
late the behavior of the atmosphere and 
ocean over years and centuries. But much of 
what happens on the planet—cloud forma- 
tion, for example—arises at scales smaller 
than those grids. Therefore, those phe- 
nomena have to be described indirectly— 
“parameterized” in the jargon of climate 
science—with rule-of-thumb equations. The 
modelers then adjust those various knobs to 
best represent the world as they know 
it—a process called tuning. “It’s a mix 
of intuition and empiricism and some 
physically observed laws,” says Isaac 
Held, Schneider’s mentor and a scien- 
tist at the Geophysical Fluid Dynam- 
ics Laboratory, a prominent modeling 
center in Princeton, New Jersey. 

Make no mistake: Current models 
do an admirable job of re-creating the 
world. But their shortcomings drive 
scientists bonkers. They struggle to 
re-create Arctic temperatures and 
melting sea ice. Their distribution of 
rainfall is off, biased against the ex- 
treme torrents that can cause flood- 
ing. “The rain is falling in the wrong 
place and at the wrong rate,’ says 
Paul O’Gorman, an atmospheric sci- 
entist at the Massachusetts Institute 
of Technology (MIT) in Cambridge, 
who formerly worked with Schneider. 
And, especially important, the mod- 
els often fail to simulate those thick 
stratocumulus clouds, which _ typi- 
cally form off the coasts of the western 
Americas and help cool the region. 

Schneider, 46, has not always been 
fixated on clouds. Early in his career at 
Caltech, he focused on large-scale atmo- 
spheric flows, such as the Hadley cell. 
That atmospheric conveyor belt shifts 
air from the equator to the subtropics— 
the type of pattern that climate models can 
simulate using simple laws of physics. But 
while on an appointment at ETH Zurich in 
Switzerland, he became increasingly con- 
vinced that climate models could do a better 
job integrating new data on cloud behavior. 
He returned to Caltech in 2016 to seek a so- 
lution, adding a joint appointment at NASA’s 
Jet Propulsion Laboratory (JPL) in Pasadena, 
where he had become a close collaborator 
with one of JPL’s cloud gurus, Joao Teixeira. 

That was the start of what is now a collabo- 
ration of about two dozen people. AI, particu- 
larly a variant called machine learning, was 
on the upswing, and Schneider and Teixeira 
mused that it might help with the cloud prob- 
lem. Soon they recruited Andrew Stuart, a 
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soft-spoken computational mathematician at 
Caltech. The team found additional recruits 
at JPL, which has a vast archive of satellite 
data on clouds, and across the country at 
MIT, where researchers had built an ocean 
model infused with every possible satellite 
and buoy measurement of the seas. 

The MIT group had ambitions to go bigger, 
and its members welcomed Schneider’s over- 
ture. “Always the idea was to go to an Earth 
system model,” says MIT physical oceano- 
grapher Raffaele Ferrari. “But the atmo- 
spheric community wasn’t particularly 
primed to think the same way.” 

At first, the nascent collaboration was 
not set on creating a new climate model; 
the United States already has six prominent 


His core, the Non-hydrostatic Unified Model 
of the Atmosphere, is designed from the 
ground up for modern parallel computing. 
The core is also flexible and self-contained. 
It can solve equations to various degrees of 
accuracy in the same model, which should 
allow the Earth Machine to give a low-resolu- 
tion overview of the planet while zooming in 
on clouds in real time. 


CRUCIAL INPUT to the new model will come 
from simulations that have recently painted 
a much sharper picture of the low clouds and 
how they behave. Called large eddy simula- 
tions (LESs), those models trade the global 
scale and centuries-long time horizons of a 
climate model for narrow scope and high 


Learning the climate 


Anew data-driven climate model will use satellite observations and high-resolution simulations 
to learn how best to render its clouds. Similar methods will also be applied to other, small-scale phenomena, 


such as sea ice and ocean eddies. 


Global climate 
model 


High-resolution cloud models will 
run as cells within the climate model, 


In orbit since 2006, the 
Cloud-Aerosol Lidar 
and Infrared Pathfinder 
Satellite Observation 
spacecraft uses lasers 
to peer into clouds. 


Earth observation 


The model will use 
artificial intelligence (Al) 
to learn from averaged 
weather data and, 
eventually, direct satellite 
observations. 


Al 


Cloud simulations 


guiding its global simulation. 


models. “It was more a question of how can 
we build a better model,’ Schneider says. 
But they wanted to be certain that a full 
climate model would incorporate their in- 
novations. They decided the best way would 
be to build a new model, albeit one start- 
ing with existing code. Doing so meant they 
needed a computation whiz who could take 
their equations and make them run on a 
next-generation supercomputer. 

AUS. Navy expert reported for duty. Frank 
Giraldo, an applied mathematician at the Na- 
val Postgraduate School in Monterey, Califor- 
nia, is behind the Navy’s new dynamical core, 
the mathematical engine at the heart of its 
next-generation weather and climate models. 
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exacerbate warming. First, higher temper- 
atures appeared to allow more dry air to 
penetrate thin clouds from above, prevent- 
ing them from thickening and reflecting 
more of the sun’s energy. Second, increased 
carbon dioxide (CO,) levels trap heat near 
the clouds’ tops, preventing their cooling. 
Because such cooling drives the turbulence 
that forms clouds, the effect could impede 
cloud formation, fueling further warm- 
ing. If CO, emissions continue unabated, 
Bretherton says, “It’s possible that most of our 
low clouds in the tropics would melt away.” 

Other evidence, including actual cloud 
observations, also suggests “that the low- 
cloud feedback is positive and that low 
clouds will amplify climate warming,” says 
Stephen Klein, an atmospheric scientist at 
Lawrence Livermore National Laboratory 
in California. 

Those breakthroughs have not yet made 
their way into global models because no 
bridge, or technical way to get them there, 
has emerged. But Schneider’s team is build- 
ing one: an LES that can simulate cloud 
behavior over days within a domain of up 
to 100 kilometers on a side—about the size 
of one cell in a climate model. Their LES 
is based on a Caltech-developed model 
called the Python Cloud Large Eddy Simu- 
lation (PyCLES) that focuses on low clouds. 
“These simulations may not be perfect,” 
Schneider says, “but theyre much, much 
better than anything else we have.” If all 
goes according to plan, Giraldo’s code will 
run 1000 PyCLES-type models on the fly as 
individual cells inside the Earth Machine. 
The machine will also use AI to study the 
observed and simulated clouds, extrapolat- 
ing what it learns to improve the rules of 
thumb it uses to simulate clouds across 
the globe. Soon, virtual cloud decks 


resolution. The 
models re-create several days in 

the life of small parcels of the atmosphere, 
with cells only 10 meters on a side. At such 
resolutions, key aspects of cloud formation— 
such as the convection that lofts sun-heated 
air upward until the water vapor it carries 
condenses into clouds—arise directly from 
physical laws. The results sometimes closely 
resemble reality, says Chris Bretherton, a 
leading cloud scientist at the University of 
Washington in Seattle. 

Several years back, Bretherton led a proj- 
ect that used LESs to study how a 2°C tem- 
perature rise affected low ocean clouds. Two 
feedbacks emerged, both of which would 
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will sprout off the California coast. 


INCORPORATING Al into climate mod- 
eling is a work in progress. Several 
researchers, including Bretherton’s 
group and Michael Pritchard, a cli- 
mate modeler at the University of Cali- 
fornia, Irvine, trained one form of AI, 
neural networks, on high-resolution simu- 
lations of the atmosphere. They then used 
the AI to replace several classic rules of 
thumb, such as how quickly the tempera- 
ture and humidity change in rising air. “All 
of these are in the feeling-around type of 
phase,” Bretherton says. 

But neural networks and climate are an 
uneasy fit. The algorithms do best on prob- 
lems such as classification—for example, 
learning from millions of labeled photos 
what a dog looks like. The code builds 
up an intricate model for what an object 
looks like that is often wholly inexplicable 
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to human reasoning. The approach works 
for dogs—but may break down when it 
encounters something outside its training 
data—say, a camel. And for climate change, 
the future is a camel. For that reason, 
Stuart and Schneider are not banking on neu- 
ral networks to guide the Earth Machine’s AI. 
Instead, they seek a compromise, something 
between traditional rules of thumb and pure 
AI. They hope to develop code that can use 
hard-won knowledge of clouds and then fill 
in gaps with its learned intuition, essentially 
replacing the manual tuning typically done 
by modelers with a machine. 

That learning won’t be driven by indi- 
vidual cloud systems, which are imprinted 
with the atmospheric chaos that begat them. 
Rather, the AI will learn from seasonal or an- 
nual statistics on cloud coverage and other 
factors, wiping out the noise of weather. 
As Stuart and Schneider move each rule of 
thumb over to the Al’s hands, they'll wire the 
model to calculate probabilities, allowing an 
overall reckoning of uncertainty not yet seen 
in current climate models. And, partly at 
the prompting of their engineering-minded 
funders, they'll develop metrics to gauge 
how accurately the model renders the world. 
Theyre betting on recent insights that, for 
some aspects of the climate system, short- 
term accuracy in a model indicates decades- 
long viability. 

Success is far from guaranteed, the team 
agreed after the May workshop. “It could 
be that what we do ends up not improving 
the numbers, just to be completely scien- 
tifically honest,’ Stuart said. But even so, 
he added, the approach should spark new 
ideas across climate modeling. “Id say 
that’s the worst-case scenario,” Schneider 
quickly interjected. “This is why I say we 
can’t fail entirely. But I do hope we will do 
more than that.” 


AS SCHNEIDER ASSEMBLED his team and de- 
veloped a general plan, he still faced a big 
question: Who would support their dream? 
That the U.S. government would finance 
yet another climate model seemed unlikely. 
Even before President Donald Trump’s White 
House proposed cuts in climate science, 
former President Barack Obama’s admin- 
istration had explored whether the country 
needed to support so many models. 
Fortunately for the Caltech team, the 
tech philanthropists—particularly Allen, 
who has already invested heavily in 
oceanography—were looking for something 
to make a splash. They sought a risky invest- 
ment with a big potential payoff that could 
make climate forecasting more concrete. 
Schneider already had preliminary support 
for the Earth Machine from Charles Trimble, 
a Caltech alumnus who miniaturized the 
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GPS receiver, and the Heising-Simons Foun- 
dation in Los Altos, California. But to reach 
their full ambitions, they needed more, some 
$5 million annually—a goal that now seems 
in sight, though the exact financing was still 
being finalized at press time. 

The mix of ambition, metrics, and innova- 
tion embodied in the Earth Machine was ex- 
actly the type of work that Allen wants to fund, 
says Chris Emura, a Seattle, Washington- 
based computer engineer 
leading Allen’s engagement 
with Schneider. Over the past 
year, Allen’s team has been 
enmeshed in the modeling 
world, visiting leading centers 
to gauge what they can and 
can’t do. Schneider’s project, 
Emura says, has some audac- 
ity to it, with a high degree of 
“responsible risk.” The team 
also has garnered interest 
from the Windward Fund, a 
conservation charity in Wash- 
ington, D.C., that began an 
effort last month to support 
work that improves near-term 
practical climate predictions. 
And _ Schneider’s endeavor 
has won praise from Schmidt 
Futures in New York City, the 
science-focused philanthropy 
of former Google CEO Eric 
Schmidt and Wendy Schmidt, 
president of the Schmidt Fam- 
ily Foundation in Palo Alto, 
California. “It’s an attractive ICE 
blend of conservative and 
bold approaches,” says Stuart 
Feldman, the philanthropy’s 
chief scientist. 

As rumors of the Earth Ma- 
chine have spread, the project 
has drawn a mix of support, 
envy, and skepticism. A new 
approach like that is des- 
perately needed, says Trude 
Storelvmo, an atmospheric 
scientist at the University of 
Oslo. “This is a very welcome 
and innovative idea.” She adds 
that it could bolster the case for expanded 
observations of clouds—necessary because 
NASA’ current cloud satellites have worked 
nearly a decade longer than planned. 

In contrast, Amy Clement, a cloud scien- 
tist at the University of Miami in Florida, 
laments the focus on building more com- 
plex models. “As a result, in my opinion, 
we are losing a lot of our ability to gain 
fundamental understanding of the cli- 
mate system.’ However, she adds, given 
Schneider’s acumen as a climate scientist, 
the model might lead to such understanding. 


STORMS 


RAIN 
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Key uncertainties 
Although climate models 
provide a fairly faithful 
rendering of the world, 
computational limits lead 
to persistent errors that 
have proved difficult to fix. 
Low stratocumulus clouds 
are a big problem. Some 
others include: 


A band of thunderstorms 
encircles the globe near 
the equator, shifting north 
or south depending on 
the season. Models have 
struggled to represent the 
intertropical convergence 
zone, as it’s known, 

often producing two 
bands, one on each side 
of the equator. 


Current models vary 
widely in their Arctic 
temperatures compared 
with reality, leading to 
large differences in their 
simulated sea ice extent. 


Models tend to simulate 
more light rain than seen in 
reality and underestimate 
the frequency of severe 
downpours and floods. 


Bretherton, meanwhile, likes the group’s am- 
bitions but questions whether a new model is 
needed to realize them. “We already have too 
many climate models in the United States,’ 
he says. “It divides our resources and makes 
scientific progress slower.” 

Other people think the project is discount- 
ing rewards that will come when existing 
models are pushed to run globally at higher 
resolutions. Much of the climate science com- 
munity in Europe, for exam- 
ple, is invested in a proposal 
called Extreme Earth, which 
would push models to a reso- 
lution of 1 kilometer per cell. 
Although such code would 
require a network of super- 
computers and wouldn’t run 
as long as traditional models, 
it would also eliminate many 
parameters that Schneider is 
seeking to improve with AI, 
replacing them with physics. 
“Tm so frustrated with the 
idea of parameterizing these 
things,’ says Bjorn Stevens, a 
climate scientist at the Max 
Planck Institute for Meteoro- 
logy in Hamburg, Germany. 
“What I find more exciting 
is getting rid of those rules 
of thumb.” 

There’s also a big assump- 
tion baked into the Earth 
Machine: that the cloud 
problem can even be solved, 
adds Joel Norris, a cloud 
scientist at the Scripps In- 
stitution of Oceanography 
in San Diego, California. 
Perhaps any sort of param- 
eterization, even one tuned 
by AI, cannot crack clouds 
to a meaningful degree. “It 
may be the case you can’t re- 
duce the uncertainty,’ Norris 
says. Some satellite observa- 
tions essential to rendering 
clouds, such as the exact 
location of water vapor in 
the lower atmosphere, sim- 
ply don’t exist. And Schneider’s team 
could be shocked when it sees how appar- 
ently unconnected parts of the model go 
awry when clouds are tweaked, Held adds. 
“There’s just a lot of connections.” 

Schneider’s team is aware of all those 
concerns and shares many of them. But the 
members are ambitious and have grown im- 
patient waiting for a breakthrough. They’ve 
lived with human-driven climate change, 
and its dogged uncertainties, as a reality 
for their entire adult lives. It’s time for the 
clouds to lift. 
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REVIEW 


Toward a predictive theory 
of correlated materials 


Paul R. C. Kent’ and Gabriel Kotliar”’** 


Correlated electron materials display a rich variety of notable properties ranging from 
unconventional superconductivity to metal-insulator transitions. These properties are of 
interest from the point of view of applications but are hard to treat theoretically, as 

they result from multiple competing energy scales. Although possible in more weakly 
correlated materials, theoretical design and spectroscopy of strongly correlated electron 
materials have been a difficult challenge for many years. By treating all the relevant energy 
scales with sufficient accuracy, complementary advances in Green’s functions and 
quantum Monte Carlo methods open a path to first-principles computational property 


predictions in this class of materials. 


redicting materials properties, starting 

from first principles, involves solving the 

quantum many-body problem in real ma- 

terials. This has been a long-standing goal 

in theoretical and computational physics, 
chemistry, and materials science. The challenge 
was laid out by Paul Dirac in his famous 1929 
quotation, “The underlying physical laws neces- 
sary for the mathematical theory of ... the whole 
of chemistry are thus completely known, and 
the difficulty is only that the exact application 
of these laws leads to equations much too com- 
plicated to be soluble,” which was followed by a 
call for methodological developments: “Approx- 
imate practical methods of applying quantum 
mechanics should be developed, which can lead 
to an explanation of the main features of complex 
atomic systems without too much computation” 


(1), a task which has kept researchers occupied 
for almost a century. 

Two great advances toward meeting this goal 
were recognized by the 1998 Nobel Prize in 
Chemistry, awarded to John Pople and Walter 
Kohn. The first centered on constructing accu- 
rate approximations to the electronic wave func- 
tions from which one can construct directly all 
the observables of interest. The second focused 
on building functionals of electron density, which 
when minimized yield the physical density and 
the total energy of the ground state (7). Extensions 
of these methods have been successful in a large 
number of cases, but some long-standing diffi- 
culties remain. The wave function-based tech- 
niques, such as configuration interaction methods, 
have a computational cost that increases rapidly 
with system size. This makes it difficult to apply 
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Fig. 1. Green’s function diagrammatic schemes. (Left) Hedin’s diagram- 
matic scheme for the free energy ®, the self-energy £, and the polarization 
II. Calculating only the first diagram in the expansion yields the self- 
consistent GW (scGW) approximation. Inclusion of the second diagram is 
denoted by GWI;. Other approximations, such as the quasiparticle self- 
consistent GW (QSGW) approximation, which is not fully self-consistent, or 
scheme D, which resumes the second-order vertex via the Bethe-Salpeter 
equation, work very well because of the cancellation of errors. (Right) 
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these methods to solids, which owe their emer- 
gent properties to their very large number of 
particles. Although density functional theory 
(DFT) is formally exact, improving the approx- 
imate functionals used in practice is difficult (2). 
The long-established and standard implemen- 
tations of the density functional methods, such 
as the local-density approximation (LDA) or gen- 
eralized gradient approximation (GGA), can treat 
the energetics of systems in the thermodynamic 
limit, but they do not capture the atomic char- 
acter of the excitations in the so-called strongly 
correlated electron systems. A hallmark of strongly 
correlated materials is the presence of partially 
filled d or f electronic shells that are energeti- 
cally situated to participate strongly in deter- 
mining the material’s electronic, magnetic, or 
structural properties. The strong and sensitive 
relationship between these properties requires 
them to be self-consistently determined, poten- 
tially amplifying any small errors or approxima- 
tions made. Today, the standard approximations 
to DFT are not sufficiently robust, necessitating 
complementary approaches in these materials. 
Substantial progress in methods development 
for strongly correlated electron systems has taken 
place over the last three decades, owing to the 
introduction of new concepts such as quantum 
embedding, novel algorithms, and the exponen- 
tial increase in computational power. The solu- 
tion of the quantum many-body problem has 
progressed along two complementary paths: 
the quantum Monte Carlo (QMC)- and Green’s 
function-based methods. In the former, one fo- 
cuses on describing the full wave function or 
density matrix of a given material, with the 
use of statistical methods to treat the large 
number of variables. In the latter, one computes 


ZnS 
ZnSe Cc 


BN NaCl 


MgO LiF 


Relative deviation of calculated semiconductor gaps from the experimental 
band gaps (excluding the electron-phonon renormalization). Different 
Green's function—based approximations were employed: QSGW, scGW, GW 
T, (scheme B), and scheme D. The experimental data (Exp) and the 
experimental data corrected by the effects of electron-phonon interactions 
(Exp - E-ph) are shown. These results are based on the approximations 
described in reference (12). Adapted with permission from (12), copyright 
(2017) by the American Physical Society. 
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Green’s function, a function of a smaller 
number of variables, from which one can 
extract observable information. Despite the 
different paths, these methods have now 
progressed sufficiently to inform each other. 
In this brief review article, we discuss these 
approaches critically, together with recent 
notable achievements. 


Green’s function approach 


The quantum-mechanical Hamiltonian 
is an operator that represents the energy 
and governs the dynamics of the electrons 
in a solid. It can be written down explic- 
itly; however, the evaluation of its ground- 
state wave function is impractical because 
it is a function of a very large number of 
coordinates (on the order of 107? in a 
typical-size crystal) that cannot be realisti- 
cally stored or computed. The Green’s 
function approach shares the spirit of the 
DFT, as it formulates the exact solution 
of the quantum many-body problem in 
terms of a function, G, of a small number 
of variables. Furthermore, the approach 
works in the thermodynamic limit of a 
very large number of electrons. An advan- 
tage of this group of methods is that the 
form of the functional expressing the free 
energy in terms of G and the Coulomb 
interaction V, is known explicitly as asum 
of an infinite set of Feynman diagrams, 
which is a visual way to represent the 
physical processes of interacting electrons 
in a solid. The functional was first written 
down by Luttinger and Ward (3) and Baym 
and Kadanoff (4); its truncation gives rise 
to various approximate solutions to the 
quantum many-body problem (5). 

Green’s function G(w) is an operator 
whose matrix elements (7'|G(@)|r) = 
G(r,r', @) are defined as the Fourier trans- 
form with respect to time of the quantum- 
mechanical amplitude for an electron to 
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Fig. 2. The multidimensional computational complexity 
of Green’s function approaches. Green's function calcula- 
tions have to be converged along multiple axes to reach 
the full solution of the quantum many-body problem. 

The z axis denotes the order of Hedin’s systematic PT in the 
screened Coulomb interaction W (blue dashed lines). The 
first-order PT leads to the GW approximation. The next 
perturbation order, GWIj, has only recently been computed 
(12). Other approaches, such as the those exploiting the 
DFT Kohn-Sham Green function or the QSGW methods, 
are indicated with dashed green lines. An alternative 
approach is to sum all local diagrams to infinite order in 
PT, as done within the single-site DMFT. This local approxi- 
mation can be improved through cluster extensions, as 
indicated schematically on the y axis (labeled by locality), 
which parametrizes the level of locality of the diagrams. 
Unlike model Hamiltonian studies, realistic electronic 
structure calculations involve numerous discretizations and 
many other parameters that need to be converged, such as 
the size of the basis set, the number of k points, and the 
dimension of the space of correlated orbitals. We represent 
the reduction of these discretization errors as increasing 
along the x axis. Future advances will involve different 
choices of locality and discretization errors at different 
orders in the PT. We represent this path as an arrow in the 
three-dimensional space. Adapted from (94). 


an accurate evaluation for use with em- 
bedding methods is an open problem and 
an active area of research (8-10). 

The pioneering work of Hedin (1D) out- 
lined a systematic perturbation expansion 
in G and W, which is sketched in the left 
panel of Fig. 1. The lowest-order term for 
the self-energy gives rise to the so-called 
GW approximation. The full self-consistent 
evaluation of the next term has been 
achieved only recently (72). In principle, 
higher-order corrections are computable 
numerically. If for a given material the 
series is asymptotic, one can hope to ob- 
tain very accurate answers. In spite of the 
appeal of a systematic expansion, it turns 
out that for certain physical quantities it 
is better to avoid the self-consistency or 
achieve only partial self-consistency to 
obtain better agreement between theory 
and experiment (Fig. 1). Better agreement 
between the calculated and experimental 
band gaps of semiconductors can be ob- 
tained by using the DFT Kohn-Sham Green 
functions Gxs in the evaluation of the self- 
energy and the polarization (the one-shot 
or GoW method) (13) or a quasiparticle 
self-consistent Green function defined on 
the real (14) or imaginary (15) time axis. 
It is also sometimes necessary to carry out 
partial summations of diagrams to satisfy 
conservation laws. Therefore, implement- 
ing this formalism continues to be an art 
as well as a science; finding an appropri- 
ate path in the space described in Fig. 2 is 
important for materials at the edge of the 
research frontier, as it is not possible to 
check convergence simultaneously along 
the three axes of Fig. 2. With an increase 
in computational power and improved al- 
gorithms, we can envision a future where 
this convergence can be routinely checked 
systematically. 

To understand the success of this per- 


propagate from point 7 to point r’. It is expressed 
in terms of another operator, the self-energy X(a). 
G(7,r',@) 
I 


~ @+V2 +n — Vu — Va — 2(0) ) 


Here, Vy and V..ys¢ are operators representing 
the Hartree and crystalline potentials, respec- 
tively, and u represents the chemical potential. 
Important concepts in this approach are the 
screened interaction in the solid, denoted by 
W, which represents the Coulomb interaction 
corrected by the effect of the motion of the elec- 
trons in the solid through their density-density 
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response, W! = V.' — TI, where IT is the po- 
larization. A different concept in this field is the 
notion of a partially screened local Coulomb inter- 
action, or Hubbard tensor Uspcq, Which is the matrix 
element of a partially screened Coulomb inter- 
action in a set of atomic centered orbitals Xppi (7"). 

The diagonal part of this tensor is called the 
Hubbard parameter U and describes the ener- 
getic cost of two electrons residing in the same 
atomic orbital, due to their average Coulomb 
repulsion. It first appeared in the Hubbard mod- 
el, a simplified model that captures the physics 
of the competition between the kinetic energy, 
which delocalizes the electrons, and the Coulomb 
repulsion, which favors their spatial localization. 
The remaining parts contain the physics of the 
Hund’s rule coupling J, which originates from the 
intra-atomic exchange interaction between elec- 
trons in different orbitals; J reduces the Coulomb 
repulsion energy when one has a configuration of 
electrons with parallel spins. Various methods to 
estimate the Hubbard tensor have been proposed 
(6, 7), but a precise definition that lends itself to 


turbative approach, it is useful to return to the 
Fermi liquid theory, which justifies our think- 
ing of the excitations of a solid in terms of 
quasiparticles. In many cases, the Kohn-Sham 
DFT is a good starting point for computing the 
quasiparticle properties in perturbation theory 
(PT) in the screened Coulomb interactions, and 
the lowest order in perturbation around the Kohn- 
Sham DFT is close to the experimental answers. 
On the other hand, there are materials where 
the effective Coulomb interaction is too large for 
this approach to be meaningful, as the excitations 
are better described in terms of atomic properties 
(such as local magnetic moments). A classic exam- 
ple are paramagnetic Mott insulators, which ex- 
hibit Curie-Weiss susceptibilities rather than 
Pauli paramagnetism. Other examples are mixed- 
valence and Kondo physics materials and uncon- 
ventional superconductors. In these cases, a 
summation to all orders in PT is mandatory 
to obtain meaningful results. 

A breakthrough in this area was the realiza- 
tion that this summation can be formulated and 
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carried out nonperturbatively in terms of the so- 
lution of a self-consistent quantum impurity mod- 
el, provided that one restricts the self-energy to 
be local. This defines the dynamical mean-field 
theory (DMFT) (16) approach. Originally formu- 
lated for simple model Hamiltonians where the 
summation becomes exact in the limit of large 
lattice coordination (17), DMFT was later incor- 
porated into electronic structure methods, starting 
with the LDA+DMFT approach (18, 19). 

The LDA+DMFT approach introduces the no- 
tion of locality for the self-energy = by defining a 
basis of local orbitals x, ait (r’), which are centered 


at sites defined by R’. One then divides all 


orbitals into two sets. For the uncorrelated 
orbitals, the self-energy is approximated by the 
exchange correlation potential of DFT, whereas 
the self-energy of the correlated orbitals is eval- 
uated by an impurity model [as Zjmp(@)] in a 
self-consistent medium, as illustrated in Fig. 3, 


LF, r',o) = 8(r — 7") Veer) + 
Sx gF) Emo) — Ete) ai ga %aP) (2) 
aR,B 


where x* indicates the complex correlate of x, 
o.and B correspond to orbital indexes, and 6 is 
the delta function. Eg. accounts for the con- 
tribution to the self-energy that is included in 
the exchange correlation potential V, (20). 
Single-site DMFT is just one example of a 
quantum embedding scheme. The central idea 
of this approach is a judicious use of two steps, 
truncation and embedding. In the truncation 
scheme, one selects some orbitals to be treated 
at the DMFT level, whereas the rest of the sys- 
tem is replaced by a bath that enables fluc- 
tuations among different configurations. In the 
language of quantum chemistry, rather than 
focusing on configurations with a fixed number 


of electrons, the quantum embedding allows 
fluctuations among different configurations. 
This was a major step forward relative to more 
drastic truncations that describe a solid as a 
finite isolated cluster of atoms. 
In DMFT one uses the system 
and bath to compute a local 
self-energy. For this purpose 
there have been notable ad- 
vances in the capabilities for 
solving quantum impurity 
models, such as the intro- 
duction of continuous-time 
QMC methods (21, 22). Further algorithmic 
advances will be needed to improve the effi- 
ciency of the sampling, to reach lower temper- 
atures, and to treat a larger number of correlated 
orbitals. Another difficulty is the minus sign 
problem, which appears in the solution of the 
impurity model with off-diagonal hybridiza- 
tion, which occurs in sites with low crystal 
symmetry. There is a need for impurity solvers 
working on the real axis to avoid carrying ana- 
lytic continuation of numerical data. Tailoring 
of Wilson renormalization group methods to 
the solution of self-consistent quantum impurity 
models has shown promising results (23), but 
major extensions are needed for tackling elec- 
tronic structure problems. After the truncation 
step, the self-energy is embedded in a lattice 
Green function or the two-particle irreducible 
vertex in a two-particle response to obtain 
momentum- and frequency-dependent physical 
quantities (Fig. 3). 

Other embedding methods exist that treat 
solid-state systems. The density matrix embed- 
ding theory (DMET), introduced by Knizia and 
Chan (24), was inspired by DMFT, but it gives 
rise to a simpler impurity model. The Gutzwiller 
approximation used in connection with LDA, 
called LDA+G (25, 26), is also computationally 


xai)*C | rr] |r] 


Ato 


Electron resevoir ef ey 


G(o) = 


Fig. 3. DMFT as a quantum embedding method. A quantum truncation of a periodic system is an 
impurity model, an atom or a cluster of atoms in a bath (reservoir). From the impurity model, one 
calculates irreducible quantities such as the self-energy = or the particle hole irreducible vertex I, which are 
then embedded in lattice correlation functions. These are used to calculate frequency and momentum 
observables, such as the spin susceptibility x or the one-particle Green function G. In realistic LDA+DMFT 
treatments of materials, a small subspace is correlated so that = is a small block, and the correlated 
self-energy is embedded in a large Hilbert space; thus, the lines in the equation for G represent the 
Kohn-Sham Green function from the DFT. The lines in the equation for x represent the full Green function G. 
q, momentum; V,, hybridization between the impurity and the bath of electrons. 
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“Green's functions 
and Monte Carlo 
methods are largely 
complementary.” 


less demanding than LDA+DMFT. The slave 
particle method is widely used for the study 
of model Hamiltonians and has been com- 
bined with LDA (27). The rotationally in- 
variant slave boson (RISB) 
form (28, 29) has been shown 
to be quite accurate for the 
estimation of total ener- 
gies. Recently, simple con- 
nections among all these 
approaches have been un- 
covered (30, 32), with LDA+G 
being equivalent to LDA+RISB 
and DMET appearing as a limiting case of 
RISB. For these simplified methods to reach 
first-principles status, a prescription for the 
determination of the Hubbard U tensor to be 
used is needed. 

Single-site DMFT can be extended by taking 
as a reference system a cluster of sites, rather 
than a single site (moving to the right on the 
locality axis of Fig. 2). There are different cluster 
extensions of single-site DMFT (5, 32) and many 
subtleties: for example, how to maintain causality 
(33). In practice, different materials require dif- 
ferent types of approximations; hence, many- 
body theory of correlated systems requires some 
judicious choices, guided by physics and chem- 
istry, deciding of the level of approximation for a 
given material in Fig. 2. For example, a material 
such as VO, which contains dimers in the unit 
cell, requires two-site DMFT (34), whereas in 
other systems such as V2Os, single-site DMFT 
is sufficient (35). 


Representative applications: 
Theoretical spectroscopy 


The goal of theoretical spectroscopy is to infer 
different correlation functions—for example, the 
one-electron correlation described in Eq. 1 or 
the correlation function between the electronic 
spins—with minimum or no input parameters. 
To extract this information experimentally, dif- 
ferent large-scale facilities with different experi- 
mental tools, such as angle-resolved photoemission 
spectroscopy or neutron scattering, respectively, 
are needed. Theoretical spectroscopy also pro- 
vides a framework to interpret the experimental 
results and advances the field by falsifying or 
verifying theories and indicating which aspects 
of the modeling need further attention. LDA 
+DMFT is now widely used in this area, with 
surprisingly good agreement with experimental 
data; for example, it has been used to explore 
the variation of the spin fluctuation spectra of 
the iron pnictides and chalcogenides (36, 37) 
and the heavy fermion systems (38). 

We provide a more detailed example of the 
power of this method in suggesting the solution 
of the mystery of Pu, a distinctive elemental solid. 
Pu metal is different from all other elements in 
the periodic table because it forms so many vastly 
different allotropes in its phase diagram in a 
narrow temperature range. Even its simplest face- 
centered cubic (fec) phase has remarkable trans- 
port and thermodynamical properties that have 
for many years defied qualitative understanding. 
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LDA+DMFT provided a comprehensive theory 
of this material. The anomalies in its properties 
can be traced to a mixed-valence condition, 
where the shell of electrons fluctuates between 
an f° and f° configuration with a characteristic 
mixed-valent scale that had been predicted to 
be about 80 meV (39). 

The DMFT valence histogram describing the 
quantum-mechanical probability for a given f 
configuration is shown in Fig. 4. LDA+DMFT 
angle-integrated photoemission spectroscopy 
identified fingerprints of this mixed-valent state 
(Fig. 4, A and B). LDA+DMFT-based neutron 
theoretical spectroscopy was used to predict the 
powder average neutron scattering spectra of 
6-Pu, as shown together with the experimental 
results in Fig. 4, D and E. 


Wave function methods 


To estimate the complete ground-state wave 
function in a solid, with its characteristically 
large number of electrons and its exponen- 
tial scaling in the number of orbitals, seems 
at first sight a task that is impossible to carry 
out to useful accuracy. Nevertheless, stochastic 
algorithms offer a more scalable alternative to 
deterministic methods, and a quarter century of 
materials applications underscores their practi- 
cality. Stochastic algorithms were initially used 
for the homogeneous electron gas and then for 
weakly correlated solids and molecules (40), but 
algorithmic breakthroughs in real-space QMC 
(41-43) and the more recent development of 
related methods such as auxiliary-field QMC 
(AFQMC) (44) and full-configuration interaction 
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more established real-space methods scale with 
the second to fourth power of system size, de- 
pending on the measured quantity. AFQMC has 
similar scaling and gives simpler access to prop- 
erties other than the energy but is generally 
believed to have a much larger cost prefactor. 
FCIQMC has exponential scaling, leaving a far 
smaller range of systems for which the method 
is applicable, but it is already near-exact in its 
current formulations. Although materials appli- 
cations today may require the use of millions of 
supercomputer hours, these QMC methods all 
have the potential to treat a wide range of strong- 
ly correlated materials and potentially demonstrate 
systematic convergence of all errors. Although 
methodological improvements are needed, in- 
creased computer power is steadily increasing 
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their range of application. 

The central idea of QMC methods is to avoid 
explicit numerical integration of the many-body 
Schrédinger equation and to recast the problem 
statistically via use of Monte Carlo integration 
and stochastic projection. This allows the methods 
to operate on thousands of electrons, compared 
with the tens or low hundreds reached by today’s 
most accurate deterministic algorithms, in ex- 
change for a statistical uncertainty. Importance 
sampling via an input trial wave function can be 


Fig. 4. Using LDA+DMFT to understand plutonium. (A) Experimental (95) and (B) theoretical (96) angle- 
integrated photoemission spectroscopy of 5-Pu. (A) was adapted with permission from (95), copyright (2000) by 
the American Physical Society. (B) was reproduced from (96). The arrows highlight two surprising features in 
addition to the narrow resonance and the broad Hubbard band, assigned to quasiparticle multiplets. For 
comparison, the experimental a phase is shown together with a compressed fcc phase. a.u., arbitrary units; DOS, 
density of states. (C) DMFT valence histogram of 5-Pu, reproduced from (96). The histogram displays the 
occupancies of two distinct valences, f° (blue bars) and f° (green bars), which are key to understanding the 
anomalous properties of Pu. The bars with the same color correspond to distinct values of total spin J of the 

Pu atom [for more details, see (96)]. (D) Experimental and (E) theoretical neutron spectroscopy on a powder 
sample. Reproduced from (97). F, magnetic form factor; Q, momentum transfer; is, Bohr magneton; f.u., formula 
units; ARCS, angular-range chopper spectrometry; 7, temperature; 4, Planck's constant h divided by 2r. 
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Fig. 5. Recent applications of QMC to strongly correlated materials. 
These applications were enabled by algorithmic advances and increased 
computational power. By using the atomic parameters and (A) the structure 
of the conventional unit cell of (Sr/Ca)2CuO3s, (B) total energies (shifted by 
Eo = 34,705 eV) for a ferromagnetic (FM) and an antiferromagnetic (AFM) 
state were evaluated to (C) extract the superexchange (J) by exploiting the 
variational principle and using trial wave functions from GGA+U. The best 
agreement with experimental data (shown in gold bands) is found at the 
lowest Eomc — Eo energies. Adapted from (49). (D) This method was also 


used to improve the statistical efficiency, with 
higher efficiencies obtained by physically more 
accurate trials. The avoidance or minimization 
of numerical integration also allows for the use 
of more general wave functions, allowing more 
physics to be built in. As the trial wave function 
is improved and approaches the exact result, 
physical accuracy is increased and the number 
of Monte Carlo samples required for a given sta- 
tistical accuracy is reduced, motivating develop- 
ments of better trial wave functions. 

The most fundamental challenge in QMC is the 
fermion sign problem: to sample an all-positive 
probability distribution derived from the wave 
function requires knowledge of where the wave 
function changes sign between positive and neg- 
ative. To achieve this and determine the exact 
wave function formally has an exponential com- 
plexity. All the different QMC methods therefore 
aim to obtain a sufficiently accurate result by 
either introducing controlled approximations 
or reducing the prefactor of the exponential 
sufficiently that interesting materials and prop- 
erties can be studied. 

To guide the discussion, we will use a very 
general form of the many-body wave function in 
the solid as given by 


M 


¥({R}) = S > civiDte") 


q=1 


(3) 


where {R} is the complete set of electron po- 
sitions {7}, D} and D} are Slater determinants 
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filled with various single-particle orbitals, and c; 
are expansion coefficients, multiplied by an op- 
tional Jastrow factor J that correlates electron 
positions. / is the number of electrons. The single- 
particle orbitals are usually obtained from DFT 
or quantum chemistry and the remaining com- 
ponents through numerical optimization. The trial 
wave function is therefore not exact and inherits 
some of the error made by the other methods. 
The most accurate and widely applied QMC 
method for materials is diffusion Monte Carlo 
(DMC). Within this method, a reformulation of 
the time-dependent Schrédinger equation is used 
to project out the ground-state wave function ®o 
from an input trial wave function ’y. The use of a 
projection algorithm minimizes dependence on 
inputs, but with the complication of computing 
the mixed-probability density |®o‘¥r|. The total 
energy is easily computed from this mixed dis- 
tribution, but this method is more difficult to 
use to compute properties that do not commute 
with the Hamiltonian. To ensure that the solu- 
tion has the correct symmetries of a fermionic 
state in DMC, the nodal surface of the solution 
is constrained to that of a trial wave function. 
This introduces a variational error in the com- 
puted energy, but this variational nature allows 
different options to be tested and the lowest- 
energy input to be rigorously selected. The chal- 
lenge of applying DMC in correlated materials 
has therefore been to find a source of trial wave 
functions with sufficiently accurate nodal sur- 
faces. Ideally these inputs would be obtained 


used in reference (50) to estimate phonon frequencies and gaps in isostructural 
LazCuO,; a correction of 0.5 eV was applied to the DMC values as the 

gap was evaluated at the I point. The agreement for gaps of correlated electron 
systems in fixed-node DMC (FN-DMC) was tested in several oxides, as also 
shown in (D). PBE, Perdew-Burke-Ernzerhof functional. (E) The comparison 
between calculated values within fixed-node DMC of the superexchange constant 
(J), gaps (Ag), and phonon frequencies (wgig), with their values obtained by 
several other methods and experiment. PBEO, hybrid functional based on PBE. 
Reproduced with permission from (41), copyright (2016) by IOP Publishing. 


and optimized within QMC for a fully internally 
consistent theory, but this has not yet been 
realized in the solid state, although it has been 
achieved in molecules (42, 43). 

Starting in the mid-2000s, the QMC commu- 
nity began to realize that modern hybrid den- 
sity functionals and simple variants of them 
could be a sufficiently accurate source of trial 
wave functions for transition metal oxides, with 
the variational principle used to select the best 
for each application. In these materials, the pre- 
dictive power of DFTs is questionable. Accurate 
QMC results for materials such as MnO (47) and 
FeO (48) eventually spurred calculations for the 
challenging parent phases of the copper oxide- 
based superconductors (49-51). Here the calcu- 
lations that are known to be most accurate 
through the variational principle were shown 
to predict magnetic exchange interactions and 
band gaps in best agreement with experimental 
data. Although inexact because of the limited 
flexibility in the input trial wave functions, these 
methods can now link fully atomistic structural 
models either to experimental data or to Hubbard 
model calculations, without empirical parame- 
ters. Similar techniques have been applied to the 
iron pnictide superconductors (52), the volume 
collapse in elemental cerium (46), and the phase 
transition in vanadium dioxide (53, 54). Promis- 
ingly, these results have been obtained by using 
simple trial wave functions, leaving open the pos- 
sibility of substantial improvements with more 
sophisticated and physically motivated choices. 
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Whereas the above-discussed techniques can 
be considered to work in real space, there are 
newer alternative approaches. FCIQMC (45) per- 
forms a stochastic sampling of the entire wave 
function on the basis of determinants. With near- 
exact but exponential scaling with increasing 
system size, current algorithms enable the study 
of simple solids (55), albeit with a very strong 
finite basis and finite size effects. AFQMC also 
makes use of a trial wave function, as does 
DMC, but uses a mathematically transformed 
Hamiltonian to enable sampling in terms of 
independent particles interacting with auxiliary 
fields. In materials, this sampling usually in- 
volves single-particle orbitals obtained from DFT. 
Compared with DMC, this method has the ad- 
vantages of increased simplicity and allowing 
substantial reuse of the computational machinery 
from mean-field methods, as there is no Jastrow 
factor. This also allows easier access to materials 
properties. Although there are not yet head-to- 
head comparisons in correlated materials, the 
performance of AFQMC compared with that of 
other QMC methods is high in model systems 
(56). As with DMC, improving trial wave functions 
is key because they act as an initial constraint. 

Although improved treatment of the fermion 
sign problem in all of these methods has long 
been recognized as an important challenge, they 
are also nearly exclusively formulated at zero 
temperature. Dynamics are little explored other 
than for the lightest elements of hydrogen and 
helium because of the high computational cost. 
Finite temperature properties of materials cur- 
rently have to be accessed via other theories. 


Representative applications: Total 
energies, magnetic exchange, and 
electron-phonon coupling 


Accurate calculations of total energies, free en- 
ergies, and enthalpies are essential for design- 
ing new materials. They enable evaluations of 
lattice constants, phonon energies, and forma- 
tion energies. Evaluating them with methods 
that go beyond the more standard implemen- 
tations of DFT has been the focus of multiple 
efforts. The accuracy and the relatively weak 
dependence on input parameters of QMC methods 
makes them an ideal tool for this task, in particular 
for light elements, where they have been used as a 
benchmark for different van der Waals density 
functionals (57-59). 

QMC methods can also be used to infer in- 
formation about excitations by arranging the 
excited-states wave function to be orthogonal to 
the ground-state wave function either by fixing 
different quantum numbers, such as spin, in the 
linear combination of Slater determinants in 
Eq. 3 or by enforcing a symmetry different from 
the ground state. Calculations for excitations 
must be performed individually, and in general 
complete spectra over a wide energy range are 
inaccessible. We illustrate recent successes of 
this method in correlated oxides in Fig. 5. 

Though the evaluation of total energies and 
free energies has long been a strength of QMC 
methods, progress has been made in the use of 
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Green’s function methods for these analyses. 
Variants of the GW approach (60, 67), LDA+G 
(30, 62) and LDA +DMFT (61, 63-66), have been 
used. The latter gives access to the effects of 
electronic entropies on the structural proper- 
ties. This effect can be very important in strongly 
correlated materials. The variants of LDA+G, 
in turn, have provided zero-temperature phase 
diagrams of several actinides (30, 62). 

Another important quantity is the electron- 
phonon coupling, which can be related to the rate 
of change of electronic excitations with changes 
in the atomic positions. In the past decade, its 
evaluation in linear response theory implemen- 
tations of the DFT has been carried out, the results 
of which have been successful in numerous com- 
pounds. For example, the dc resistivity as a function 
of temperature of the full elemental transition 
metal series has been explained without adjust- 
able parameters (67). 

Still, this approach sometimes fails in notable 
cases. One example is the high-temperature super- 
conductor Ba,Kq — x) 
BiOs, for which calcu- 
lations gave a value of 
i. = 0.3, which cannot ac- 
count for an observed 


“A hallmark of strongly 
correlated materials is the 


na. This requires accuracies that are out of reach 
today. The scheme could also lead to improve- 
ments in more practical faster methods (73), thus 
opening up correlated materials for materials 
design and optimization in the same way that 
weakly correlated materials are today. Some of 
the challenges, such as simply computing a suf- 
ficiently large number of atoms to capture the 
required physics, are greatly helped by the rapid 
advances in computational power, whereas other 
fundamental challenges require improved theories. 

Green’s functions and Monte Carlo methods 
are largely complementary. Green’s function 
methods are ideally suited to describe spectros- 
copies, as they target exactly the quantities that 
are probed in experiments (Fig. 4), whereas Monte 
Carlo methods target total energies, as they have 
weak parametric dependence (Fig. 5). 

The ranges of application of the different 
methods have recently begun to overlap, creating 
exciting opportunities. Combinations of methods 
advancing along the locality and PT axes of Fig. 2 
were suggested very 
early on, as in the GW+ 
DMFT method (74, 75), 
but only recently have 
self-consistent imple- 


superconducting transi- mentations been carried 
tion temperature of 35 K ° out, as in the MQSGW 
(68). Recently, Yin et al. Pp resence of P art ally fi lled (Matsubara quasipar- 
ida te d or f electronic shells that _ te seh annpeniee 
long-range Coulomb in- ee ° DMFT (76) me an 

teractions are important coop) articip ate str ongly m the multitier self con- 
in some high-temperature 7 Pad id sistent GW+EDMFT 
superconductors and that determining the material oe (extended dynamical 
the induced static corre- DI operties. mean-field theory) (77). 
lations increase the elec- Locality can be ex- 
tronic bandwidth and ploited in higher-order 


enhance the electron-phonon coupling. This 
correlation-enhanced electron-phonon coupling 
provides a natural explanation for the super- 
conductivity in this and related systems. Recent 
photoemission experiments have confirmed the 
main predictions of the theory (70). 

The iron pnictides and chalcogenides are Hund’s 
metals, a different class of strongly correlated 
materials where the correlations derive from 
Hund’s coupling rather than the Hubbard 
U. LDA+DMFT calculations for FeSe predicted 
a large enhancement (on the order of a factor of 
10), relative to its LDA value, of the coupling 
between some Fe-3d correlated states and one 
phonon mode, which corresponds to the symmet- 
ric modulation of the distance of the chalcogenide 
atoms from the iron planes (77). This LDA+DMFT 
prediction has recently been confirmed experi- 
mentally by using a novel technique at the x-ray 
free-electron laser (72). 


Outlook 


Buoyed by advances in concepts, algorithms, and 
supercomputing, we can contemplate an accu- 
rate fully ab initio approach to compute the 
properties of correlated materials with only con- 
trolled approximations. If realized, such a scheme 
would enable highly robust predictions for 
strongly correlated quantum-mechanical phenome- 


vertex functions and can be used to generate 
one-particle Green functions in methods such as 
dual fermions, dIA (78), and trilex (79). Incor- 
porating these ideas into realistic electronic 
structure is another direction to advance in the 
y - plane of Fig. 2 and is in the very early stages 
(80). The successrepresented in Fig. 1 in sys- 
tematically improving the estimation of semi- 
conductor gaps, together with the early suggestion 
of Zein et al. (81) that higher-order graphs are 
increasingly local, bodes well for this research 
direction. QMC methods have excellent access 
to the electron density, potentially allowing the 
inputs and outputs of DMFT calculations to be 
validated or improved. QMC methods now also 
have better access to excited states and are be- 
coming capable of diagnosing when GW meth- 
ods are best applied (82). Determining more 
accurately the effective Hubbard parameters 
that enter LDA+DMFT is another outstanding 
challenge to which QMC may contribute in the 
future (83). 

Green’s function methods that exploit locality 
are strongly dependent on the orbitals and pro- 
jectors that are used to carry out the infinite- 
order resummations, as well as on the Hubbard 
interaction tensor. Investigations of criteria for 
optimizing these orbitals, with the concomitant 
determination of the local Hubbard interaction 
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matrix, are needed. Although QMC approaches 
are less sensitive on parametric choices, there is 
also now a common need, within all the QMC 
approaches, for improved trial wave functions 
to treat stronger correlations and to treat more 
complex systems. To be applied near coupled elec- 
tronic, magnetic, or structural phase transitions— 
all hallmarks of strongly correlated materials—the 
methods must become independent of the mean- 
field theories and DFTs that are used as the 
foundation today. Current methods are expected 
to break down in these regimes, and cross- 
validation of new methods and new applications 
will be necessary (56). Encouragingly, the emer- 
gence of methods to generate multideterminant 
trial wave functions provides a practical mecha- 
nism to achieve this: selected configuration in- 
teraction can be used to produce systematically 
more accurate trial wave functions for DMC or 
AFQMC calculations; analysis of the results may 
be used for physical insight and for further tai- 
loring of the configuration interaction expansions 
for larger systems, while maximizing accuracy. 
Although perturbative methods improve the trial 
wave functions markedly faster than a naive con- 
figuration interaction expansion, physical insight 
to improve these methods specifically for the solid 
state will be highly beneficial. This will open up 
applications in elemental solids through to com- 
plex oxides, at ambient conditions through to 
conditions of high applied field or pressure. 

We have not mentioned, among wave func- 
tion methods, parallel progress within the field 
of quantum chemistry, where there are a range 
of complementary methods for treating electronic 
correlations in molecular systems [(84) and 
references therein]. Although traditionally imple- 
mented only for finite or isolated systems, these 
methods can be recast in the language of periodic 
boundary conditions needed to study materials in 
the solid state. This also applies to methods such 
as the coupled cluster, which have also recently 
been extended to periodic systems (55, 85). These 
approaches stand to bring a half century of 
theoretical chemistry developments and analy- 
sis techniques to correlated materials (86). Con- 
versely, ideas from DMFT are beginning to make 
their way into the realm of quantum chemistry 
(87-90). An important challenge for all of these 
methods is the use of frozen atomic core or pseu- 
dopotential approaches: Calculations actively 
using all electrons are too costly for general 
materials, whereas calculations that approxi- 
mate the treatment of core electrons risk introduc- 
ing significant errors. 

The need and potential effects for society of 
scientific investment in computational first- 
principles approaches for correlated electron ma- 
terials can hardly be overestimated. For weakly 
correlated electron systems, it took many years 
after the development of the DFT to reach the 
point where practical implementations of sev- 
eral algorithms using this method as an engine 
in user-friendly codes guided the discovery of 
the highest-temperature superconducting criti- 
cal temperature of a material, H3S under high 
pressure near 190 K (97, 92). For a historical ac- 
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count of the research leading to this break- 
through, see reference (93). It is our hope that 
some of the developments reviewed in this article 
will help bring this new research paradigm of 
theory-assisted materials design to the realm 
of strongly correlated solids. 
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REVIEW 


Biophysical experiments 
and biomolecular simulations: 


A perfect match? 


Sandro Bottaro and Kresten Lindorff-Larsen* 


A fundamental challenge in biological research is achieving an atomic-level description 
and mechanistic understanding of the function of biomolecules. Techniques for 
biomolecular simulations have undergone substantial developments, and their accuracy 
and scope have expanded considerably. Progress has been made through an increasingly 
tight integration of experiments and simulations, with experiments being used to 

refine simulations and simulations used to interpret experiments. Here we review the 
underpinnings of this progress, including methods for more efficient conformational 
sampling, accuracy of the physical models used, and theoretical approaches to integrate 
experiments and simulations. These developments are enabling detailed studies of 


complex biomolecular assemblies. 


n modern biological research, a key goal is 

to understand the functional consequences 

of structure, dynamics, and interactions of 

biological macromolecules. Proteins, lipids, 

carbohydrates, and nucleic acids interact, re- 
arrange, and modify their shape while effecting 
their various functions. Experimentalists face the 
daunting task of characterizing thermodynamic 
and kinetic properties of macromolecules in a 
complex environment. Computational simulation 
plays a role in these efforts, as modeling ap- 
proaches can aid in understanding experimental 
data and designing and predicting the outcome 
of future experiments. 

Here we review the progress and current 
challenges in computational modeling of bio- 
molecules, focusing on the topic of atomistic 
biomolecular simulations and the relationship 
between experiments and simulations. We high- 
light recent technological and theoretical advances 
in the field and consider whether there is a per- 
fect match between experiments and simula- 
tions. Disagreement between computation and 
experiment provides useful insights to further 
our understanding, and their complementary use 
yields a clearer picture than either does alone. 


Biomolecular simulations across length 
and time scales 


Experimentalists often collect data that must 
then be synthesized into a coherent model 
through inverse problem-solving. Computa- 
tional modelers deal with the forward problem: 
constructing a microscopic molecular model that 
can be compared with observed data (Fig. 1). In 
the best-case scenario, computational models are 
fully predictive and widely applicable. This would 
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correspond to a model that correctly predicts the 
physical behavior of a system—for example, the 
catalytic power and stability of an enzyme as a 
function of pH and temperature. Such perfect 
models do not exist, so variants are developed 
with distinct strengths and areas of application. 

Detailed information for reaction mechanisms 
and transition states in chemical reactions can 
be obtained via quantum mechanical (QM) cal- 
culations. These allow for simulation of the 
electronic properties of a subset of atoms within 
a macromolecule, which can be used to investi- 
gate bond cleavage and formation, distribution 
of charge and spin, and reaction mechanisms. 
Simulation of electronic properties of molecules 
requires a great deal of computational power, 
and thus the applicability of QM methods is, in 
general, limited to small systems or short time 
scales (1). Molecular dynamics (MD) simulations 
with empirical molecular mechanics force fields 
treat atoms as classical particles rather than 
considering their electronic structure: This ap- 
proximation makes it possible to study the struc- 
ture and dynamics of larger systems for longer 
periods of time, such as small proteins at the 
millisecond time scale. There are many relevant 
biological processes, however, that involve much 
larger biomolecular assemblies. The computa- 
tional complexity of these problems can be de- 
creased by grouping atoms together into single 
particles called beads. Such coarse-grained (CG) 
models range in resolution from one or a few 
beads per amino acid to one bead per hundreds 
or thousands of DNA bases. Despite their in- 
trinsic approximations, such models are essen- 
tial for tackling important problems in structural 
biology, including understanding complex for- 
mation between intrinsically disordered proteins 
(2) and rationalizing chromosome conformation- 
capture experimental data, thereby gaining in- 
sights into the internal chromosome organization 
(3). There also exist mixed, hybrid, and inter- 
mediate models that bridge together different 


resolutions. We focus below on the use of clas- 
sical MD simulations to study biomolecules. 


Challenges and opportunities for 
biomolecular simulations 


The successful use of MD simulations hinges on 
solving two distinct, yet related, problems: the 
“sampling problem” and the “force-field problem” 
(Fig. 2). The sampling problem refers to our ability 
to sample the relevant biomolecular configura- 
tions and to determine their relative populations. 
Exhaustive sampling is difficult to achieve, be- 
cause it is not possible to know in advance the 
required amount of sampling needed to calculate 
precise statistical averages. It is even difficult to 
assess whether a simulation is converged, because 
one may never know whether there are motions 
occurring on time scales beyond those sampled 
and robust and generally applicable tools to mon- 
itor convergence are needed (4, 5). Thus, active 
areas of development are theories, algorithms, 
and technological improvements to increase the 
precision of the simulations. 

The force-field problem refers to the construc- 
tion of the energy function that describes the 
physical interactions between atoms. Improve- 
ments in force fields thus increase the accuracy 
of simulations by providing a more realistic de- 
scription of the molecular interactions. Although 
progress on solving these two problems requires 
distinct approaches, they are tightly related. Only 
after taking into account all relevant conforma- 
tions, that is, those that contribute to thermo- 
dynamic averages, is it meaningful to calculate 
average quantities and compare them to experi- 
ments. Hence, our ability to improve force fields is 
tightly connected to improvements in sampling. 


Challenge 1: Improving physical models 


The first fundamental challenge in biomolecular 
modeling is the construction of the physical 
model itself. Trade-off between computer power 
and spatial or temporal resolution requires a 
choice of model, ranging from all-atom repre- 
sentations to CG (6) and ultra-CG models in 
which multiple residues or nucleotides are repre- 
sented by a single site (7). A long-standing goal 
of the field is to construct hybrid models that 
smoothly couple together different components 
at different resolutions (8). 

In atomistic MD simulations, interactions be- 
tween particles are modeled by “physics-based” 
terms that take into account chain connectivity, 
electrostatic interactions, London dispersion 
forces, and so on. Parameterized pairwise inter- 
action terms are fitted against QM calculations 
and experimental data to generate a force field 
that describes interactions between individual 
particles. Parameters like the equilibrium dis- 
tance between two covalently bonded atoms are 
known with high accuracy. Other parameters, 
such as partial charges, are difficult to establish, 
as they do not correspond to physical observables 
that can be directly probed through experiments. 

Accurate force-field parameterization for pro- 
teins has benefited from benchmarking and direct 
optimization of MD simulations with experimental 
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nuclear magnetic resonance (NMR) data on par- 
tially structured peptides (9, 10). Simulations of 
peptides that are 10 to 40 residues long are pos- 
sible to converge, yet can capture cooperative 
phenomena, such as helix-coil transitions or for- 
mation of small hydrophobic cores, which are 
difficult to parameterize from smaller molecules. 
Solution NMR experiments can provide residue- 
level information and are sensitive to the relative 
energies of conformations that correspond to local 
minima and have sizable populations. By optimiz- 
ing the backbone potential to match the experi- 
mentally measured helicity of a 15-residue peptide, 
as measured by NMR, a small change of about 
1kJ mol? was found to be sufficient to balance 
the secondary-structure populations (10). This 
small change in energy leads to a substantial im- 
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provement in accuracy, as the forces are accumu- 
lated over multiple residues and the populations 
scale to energies exponentially. Corrections to 
force fields obtained from examining short pep- 
tides are transferable among different structural 
classes of proteins and have improved models 
of folding processes for small globular and fast- 
folding proteins (J/, 12). 

Unfolded states and intrinsically disordered 
proteins (IDPs) have long appeared to be more 
compact and structured when observed in MD 
simulations than when observed through experi- 
ments. This discrepancy suggests that important 
physical effects were not modeled correctly in 
the simulations. Proposed solutions include im- 
proving the description of water or of protein- 
water interactions (13). These modifications have 
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Fig. 1. Simulations and experiments are complementary. (A) Solving an inverse problem aims to 
describe causal factors that produce a set of observations. Molecular simulations, conversely, can be used 
to construct a set of microscopic molecular conformations that can be compared with experimental 
observations through the use of a forward model. (B) Computational approaches to studying biomolecules 
range from detailed quantum mechanical models to atomistic molecular mechanics to coarse-grained 
models, where several atoms are grouped together. The decreased computational complexity granted by 
progressive coarse-graining makes it possible to access longer time scales and greater length scales. 
(C) Experimental data can be combined with physical models to provide a thermodynamic and kinetic 
description of a system. As the model quality improves, it becomes possible to describe more complex 
phenomena with less experimental data. SANS, small-angle neutron scattering; EPR, electron 
paramagnetic resonance; FRET, fluorescence resonance energy transfer; AG, Gibb's free energy. 


Bottaro et al., Science 361, 355-360 (2018) 


27 July 2018 


improved the accuracy of simulations of IDPs (14), 
which play important roles in biology and disease. 
A side effect of increasing protein-water inter- 
actions is, however, destabilization of folded 
proteins. In practice, one might thus have to 
choose between one family of force fields for 
simulations of folded proteins and a separate 
set for disordered systems, complicating studies 
of partially folded systems or of order-disorder 
transitions such as folding upon binding. To 
tackle this problem, it is necessary to consider 
simultaneously proteins that span from fully 
ordered to completely disordered and to test 
and optimize parameters at the same time on all 
of these systems. A comprehensive study of 
model systems with diverse properties has re- 
cently produced a force field capable of hand- 
ling both fully folded proteins and IDPs (15). 
In parallel to the development of models to 
study the structure and dynamics of proteins, there 
has been a growing interest in modeling nucleic 
acids, particularly RNA because of its catalytic 
and regulatory activities. Although important 
improvements have been made, state-of-the-art 
RNA force fields remain less accurate that those 
for polypeptides (16). Here, too, artifacts of MD 
simulations have been uncovered by direct com- 
parison against solution NMR data on small 
model systems (17). Similar to the case of IDPs, 
promising results have been obtained by balanc- 
ing water-RNA and RNA-RNA interactions (78). 
Systematic benchmarking of force fields against 
experiments has revealed a comforting trend: 
Force fields are getting better (72, 15, 19). It is 
worthwhile to note that these improvements have 
been possible even without substantial changes 
to the underlying model or mathematical func- 
tion used in the force fields. Thus, despite the in- 
herent simplicity and lack of, for example, taking 
polarization into account, it has been possible 
to improve force fields dramatically. Indeed, it 
is surprising that it is possible to parameterize 
force fields that work well across many different 
proteins and problems (20), and eventually, pro- 
gress will require models that are more complex. 
Improvements of force fields generally rely on 
ab initio QM calculations. Machine-learning ap- 
proaches, particularly neural networks, make it 
possible to train simple potentials with QM-level 
accuracy (27). Training is typically done on small 
molecules, and encouraging results have been 
obtained by transferring these potentials to the 
study of larger organic molecules (22). Force fields 
that explicitly include polarization effects are also 
likely to benefit from automated methods for 
integrated parameterization from experiments 
and QM calculations and from improvements in 
software and algorithms from sampling with these 
potentials. Here, Bayesian methods for optimizing 
force fields against experimental data and QM 
calculations are expected to play an even larger 
role, by enabling systematic balancing of differ- 
ent sources of information (23-25). 


Challenge 2: Accessing long time scales 


Atomistic biomolecular MD simulations are in- 
herently costly, owing to the need to model forces 
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between tens or hundreds of thousands of in- 
dividual atoms or more. These forces are eval- 
uated every few femtoseconds of simulation 
time, requiring about a billion steps to simulate 
a molecule for a microsecond. Although the 
speed of a simulation depends strongly on the 
size of the biomolecular system and the available 
computational resources, it is not uncommon 
to require weeks or months of computer time 
with hundreds or thousands of processors work- 
ing simultaneously to obtain microsecond- 
length simulations. 

Conceptually, the most straightforward means 
to increase the speed and throughput of molec- 
ular simulations is perhaps “simply” to decrease 
the time it takes to perform a single iteration of 
the simulation. Widely used software packages 
designed for biomolecular simulations, such as 
GROMACS (26), NAMD (27), Desmond (28), 
AceMD (29), and AMBER (30), use different 
levels of parallelization by taking advantage 
of multicore processors and high-performance 
computing facilities. Speedups can be achieved 
by off-loading calculations to graphics processing 
units, which provide high performance at rea- 
sonable cost. A different route to improve ef- 
ficiency is to build hardware specifically adapted 
to molecular simulations such as MDGRAPE (3D 
and Anton (32). For example, Anton is a massively 
parallel supercomputer designed to perform fast 
and accurate simulations of biomolecules by 
simultaneously considering all parts of the calcu- 
lations, including MD-specific integrated circuits 
for calculating the costly parts of the force-field 
interactions, a specialized communication net- 
work tailored to match the periodic boundary 
conditions used in simulation, and special par- 
allelization algorithms developed for this archi- 
tecture. Anton enabled the first millisecond-length 
all-atom MD simulation of a globular protein (32). 
Its successor, Anton 2, is optimized for larger 
biomolecular systems and can perform multi- 
microsecond simulations in a single day for sys- 
tems such as a small virus or a solvated ribosome 
with more than 1 million atoms (33). 

Massive parallelization has been exploited 
in the folding@home project, which utilizes 
hundreds of thousands of “stand-by” machines 
all over the globe (34). Such distributed com- 
puting studies may now reach multiple milli- 
seconds of aggregate simulation time and consist 
of hundreds or thousands of simulations ranging 
from hundreds of nanoseconds to a few micro- 
seconds (35). Because each simulation may be 
much shorter than the time scales of interest, a 
key problem is how to extract information about 
slow, long-time scale processes from a combined 
analysis of many short simulations. One possible 
solution to this problem is to build a Markov 
state model (MSM) (35, 36), which enables one 
to construct a “memoryless transition network” 
describing the populations and kinetics of in- 
terconversion between different conformational 
states. In recent years, MSMs have gained wide- 
spread use, thanks to improved algorithms and 
software (37, 38) and several successful applica- 
tions to biomolecular processes, including pro- 
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Fig. 2. Sampling and accuracy in molecular simulations. An MD simulation samples the temporal 
evolution of molecular configurations, but sampling is, in practice, limited to a finite time (t = tsim). 
(A) When the simulation time is much longer than the slowest time scales of motions, many 
transitions are observed between the relevant conformational states. (B) When such simulations are 
performed with an accurate force field, statistical averages are converged and will be close to 
experimental values, and the averages approach the infinite time-scale average. (C) By contrast, 
when the force field is inaccurate, converged simulations give rise to precise, but inaccurate, results. 
(D) When the simulation time is too short compared to the time scales of the system, it is difficult to 
calculate precise quantities. (E) In this case, one may get disagreement between experiment and 
simulation even when the force field is accurate. (F) The worst situation, when sampling is 
insufficient and the force field is inaccurate. 


tein folding, ligand binding, and protein-protein 
association (35). Path-based methods such as 
transition path sampling (39) and milestoning 
(40) also use many short simulations to study 
kinetics and mechanisms of long-time scale 
processes. These and related methods exploit 
the fact that many conformational transitions 
are “rare events,” for which the time it takes to 
cross the barrier is substantially shorter than 
the waiting time between such events. 
Sampling may also be enhanced by changing 
simulation parameters. Increasing the tempera- 
ture increases the kinetic energy, making barrier- 
crossing events faster (41). This idea is at the basis 
of parallel tempering, perhaps the most widely 
used enhanced sampling approach. These al- 
terations can be viewed as enhancing simula- 
tions along a progress variable, also known as a 
reaction coordinate or collective variable (CV), 
in this case related to the energy of the system. 
For some problems, rapid fluctuations of the 
energy, and similar energies in different distinct 
conformational states, mean that increased 
temperature does not transfer into efficient 
sampling. This problem can be exacerbated by 
the fact that the available conformational space 


at high temperatures is larger, so that increased 
rates of sampling are more than offset by the 
increased volume of conformational space. Ac- 
celerated MD may instead be used to “boost” 
the energy along internal degrees of freedom, 
such as the backbone dihedral angles, thus en- 
hancing the ability to cross local barriers (42). 

Enhancing sampling along one or more pre- 
specified CVs that describe the process of interest 
is another widely used strategy (43). In a protein- 
folding simulation, the number of native contacts 
formed or the progress along an initial guess 
of the folding path might be used to guide the 
simulation, even if the path is imperfect, and 
thus provide detailed insight into the folding 
free-energy landscape. Metadynamics (44) uses 
a time-dependent potential to simultaneously 
enhance sampling and construct a free-energy 
profile along such CVs and is widely used both 
because of its applicability to a range of problems 
(e.g., biomolecular processes, molecular docking, 
chemical reactions, crystal growth, and proton 
diffusion) and the availability of efficient and 
easy-to-use software (45). 

Long unbiased simulations performed with 
Anton represent a useful reference to benchmark 
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and validate enhanced sampling methods and 
kinetic models. In such applications, one may 
compare a specific protocol for enhanced sam- 
pling or constructing a kinetic model with the 
results from an unbiased simulation with the 
same force field to focus on benchmarking the 
algorithms and avoiding complications from 
force-field uncertainty (46, 47). 

Approaches based on CVs are very powerful, 
but their optimal choice is a critical and non- 
trivial step. For complex biomolecular rear- 
rangements, it is difficult to identify CVs that 
correspond to the relevant, slowly varying de- 
grees of freedom. In this respect, deep learning 
approaches have recently been used to identify 
improved CVs (48). CVs are not only useful to 
enhance sampling: They are essential to ratio- 
nalize the large amount of complex data gen- 
erated in MD simulations. New approaches to 
create better low-dimensional representations 
of high-dimensional data are also useful to 
construct improved MSMs (35), and we expect 
the advances in machine-learning methods 
[e.g., low-dimensional embedding and cluster- 
ing (49, 50)] to play an increasingly important 
role in this field. 


Challenge 3: Integrating experiments 
and simulations 


Although simulations and statistical mechanical 
theories are important and very powerful in 
their own right, direct integration of experimental 
data with molecular simulations can provide a 
rich description of the structure and dynamics 
of biomolecules. This field—also called integrative 
structural biology (57)—has benefited from recent 
technological advances in cryo-electron micros- 
copy (cryo-EM) and is particularly important for 
studies of complex, dynamic systems for which 
multiple structural techniques provide com- 
plementary information. Formally, the problem 
consists of determining the three-dimensional 
structure or, more generally, an ensemble of 
molecular conformations and their associated 
weights, which are compatible with a set of ex- 
perimental observations. 

One strategy is to modify the simulation to 
match experimental data (Fig. 3). In this case, 
the force field is not considered a fixed, im- 
mutable model but instead a fitting function 
to be adjusted by experimental data. Indeed, 
this “pseudoenergy” approach underlies most 
structure-determination algorithms in which 
a physical energy function (often a simplified 
force field) is combined with an “experimental 
energy function” that measures the deviation 
between experiment and simulation (52). These 
integrative approaches enable accurate protein- 
structure determination when using chemical 
shifts (53) (Fig. 3A) or when using sparse, un- 
certain, and ambiguous experimental data (54). 
Similar approaches have been developed with 
the aim of providing molecular models of large 
molecular complexes constructed by using di- 
verse sets of experimental data, including cross- 
linking, small-angle x-ray scattering (SAXS), and 
cryo-EM images (55, 56). The availability of dif- 
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Fig. 3. Experimentally driven simulations. 
(A) Probability distribution of the structural sim- 
ilarity to the native structure of a protein deter- 
mined by using a simplified force field (blue) or 
when the same force field is combined with NMR 
chemical-shift restraints (green) (53). RMSD, root 
mean square deviation. (B) A representative three- 
dimensional structure from the restrained 
simulation (green) and a reference structure 
(black). (C) In a conventional restrained simulation, 
the probability distribution of a measured quantity 
obtained by sampling using the force field alone 
(blue) is modified by adding an additional energy 
term that enforces the agreement with experi- 
mental data (green). In the resulting ensemble, all 
individual molecular conformations are close to the 
experimental average. (D) When heterogeneous 
conformations give rise to the measured average 
value (e.g., scalar couplings for different rotameric 
states), adding the experimental restraints to push 
individual conformation close to the experimental 
value is not correct, as this forces the simulation to 
structures that may not represent any of the 
relevant states. In maximum-entropy approaches, 
the experimental data are satisfied by introducing a 
minimal perturbation to the simulation ensemble. 
In this simplified example, the solution is a small 
shift in the populations of the two states, which 
results in a calculated average (red dashed line) 
compatible with the experiment. 


ferent sources of complementary experimental 
data is, in this context, important, as it allows 
one to cross-validate results and avoid overfitting. 

Structural experiments such as SAXS, NMR, 
and x-ray diffraction report on quantities aver- 
aged over many molecules and long periods of 
time. For rigid molecules, the error may be small 
when interpreting ensemble-averaged quantities 
for individual structures. However, dynamical 
averaging is crucial when studying flexible 
molecules, such as IDPs or single-stranded RNA, 
because the structural interpretation of exper- 
imental data must be addressed by considering 
the coexistence of multiple conformational states 
(Fig. 3). One theoretical approach for dealing 
with the averaging problem is based on the 
maximum-entropy principle (52). The basic idea 
is to introduce a perturbation to the conforma- 
tional ensemble generated by simulations in 
order to match a set of experimental data. The 
perturbation should be as small as possible: 
Mathematically, this is achieved by maximizing 
a quantity called relative Shannon entropy, hence 
the name maximum entropy. Thus, a minimal 
modification to the simulations to match the 
experimental data results in the least-biased 
combination of the force field and the experimen- 
tal measurements. In practice, these approaches 
can remove much of the uncertainty associated 
with the choice of force fields so that conforma- 
tional ensembles derived by combining experi- 
ments and simulations are more similar than 
ensembles derived solely from simulations (5, 57). 

Although the maximum-entropy principle pro- 
vides a coherent framework to obtain conforma- 
tional ensembles that combine force fields and 
experimental data, the basic formalism does not 
take sources of error into account. Another im- 
portant development has thus been theory that 
considers not only experimental measurements 
but also the associated uncertainty. When com- 
bining data from multiple experimental tech- 
niques, uncertainties are essential to set the correct 
weights among them. For some sources of ex- 
perimental data—for example, chemical shifts 
from NMR spectroscopy—the measurement itself 
is extremely precise, but our ability to relate the 
experimental quantity to molecular structure (i.e., 
the forward model that is used to calculate ex- 
perimental quantities from three-dimensional 
structures) is associated with substantial uncer- 
tainty. Both experimental and modeling uncer- 
tainty can be treated by using Bayesian approaches 
such as those used in inferential structural deter- 
mination protocols, leading to improved precision 
and a rigorous approach to integrate multiple 
sources of experimental data (58). 

Combined Bayesian-maximum entropy inte- 
grative methods that consider uncertainty and 
averaging offer a promising route to reconstruct 
the conformational variability of complex biomo- 
lecular systems (59, 60). These methods can be 
used with all-atom simulations or with CG mod- 
els for larger assemblies. For instance, the struc- 
ture and allosteric mechanism of a protein kinase 
were revealed by reweighting CG simulations 
using SAXS experimental data (67). An alternative 
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approach is to construct an MSM that has also 
been biased by using experimental data (62). 

An important challenge is how the informa- 
tion gleaned from these studies may be fed back 
into improved force fields—for example, by sys- 
tematically analyzing differences between the 
experimentally restrained ensembles and those 
obtained from the models alone. For instance, 
we recently identified a specific dihedral angle 
in the RNA backbone whose distribution in MD 
simulations was markedly different from that 
found when reweighting the same simulations 
using a Bayesian-maximum entropy approach 
(63). This observation suggested that force-field 
errors for this specific term could explain part 
of the disagreement between experiment and 
simulations, and, indeed, parallel work on im- 
proving RNA force fields resulted in distribu- 
tions for this dihedral angle that were in much 
better agreement with the experimentally derived 
results (18, 63). 

The discussion above pertains to experimental 
data that can be related to equilibrium proper- 
ties and that can be represented by population- 
weighted averages over individual conformations 
in the ensemble. For example, distances probed 
via nuclear Overhauser effect (NOE) NMR ex- 
periments are typically calculated from the aver- 
age of the inverse sixth power of the distances in 
each individual structure (58). In reality, NOEs 
and many other experimental quantities depend 
on kinetic properties that need to be taken into 
account for the most accurate calculations. Re- 
cent theoretical and practical advances make it 
possible to construct conformational ensembles 
also based on such information (62, 64-66) and 
thus extend applications to new sources of ex- 
perimental data. 


Conclusions and outlook 


The complexity of biological systems often man- 
dates the combined use of multiple techniques, 
including biomolecular simulations. Clearly, sim- 
ulations are not ordinary experiments and often 
require a detailed knowledge of algorithms, un- 
derlying assumptions, and tricks that can be 
difficult to access and understand for non- 
specialists. Much progress has been made on 
making these tools more user-friendly and ac- 
cessible, though analyzing simulations often re- 
quires specialist knowledge. With a wide range 
of tools available, it is important to balance pre- 
cision and accuracy when deciding on a sim- 
ulation strategy (sampling method, force field, 
and level of resolution): What level of detail is 
relevant to the problem at hand, what are the 
relevant time scales, and can I address imper- 
fections in the model by, for example, experi- 
mental restraints? 

Substantial improvements in force fields have 
been made possible by using data from experi- 
mental studies on systems that are large enough 
to capture complex behavior yet simple enough 
to converge simulations. Future progress requires 
that experimentalist and computational chemists 
continue to work together to design experiments 
that are best suited to optimize force fields and 
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to isolate properties that current models fail to 
describe (9, 17). By testing and optimizing models 
broadly across different classes of problems and 
molecules, it will be possible to create force fields 
that are more transferable. Eventually, we will 
have to go beyond the current simple functional 
forms (67, 68), but a surprising observation has 
been how much force fields could be improved by 
careful parameter optimization on an increasingly 
broad set of QM and experimental data. When 
reading the simulation literature, one should thus 
check whether a carefully validated force field 
has been used. Judging this is helped by the in- 
creased availability of systematic comparisons 
on a broad range of systems (72, 15, 19, 69). 

Further, as it remains difficult to sample con- 
formational space sufficiently, particularly for 
complex systems, one should check whether 
convergence has been assessed and whether 
quantitative differences are backed up by suf- 
ficient sampling. This is inherently difficult 
because it is much easier to prove lack of con- 
vergence than the opposite (70). Nevertheless, 
useful questions to ask include (i) whether the 
same events are observed multiple times, (ii) 
the simulations are longer than the correlation 
times and the statistical analyses take time 
correlation into account, and (iii) whether the 
observed effects are greater than the statistical 
uncertainty. 

We must, however, also be pragmatic in the 
way simulations are used. Like experiments, sim- 
ulations are not perfect, and we will continue to 
live with uncertainty in sampling and force fields. 
Here the integration between experiment and 
simulations can help alleviate problems in 
both accuracy and sampling. We envision that 
these methods will play an increasingly impor- 
tant role in studying the relationship between 
structure and dynamics of large biomolecular 
assemblies or highly flexible molecules. The link 
between molecular simulations and cryo-EM, 
inherently a single-molecule technique, might 
be particularly fruitful for looking at conforma- 
tional dynamics at high spatial resolution (77, 72). 
Much can also be gained by carefully choosing 
systems that are amenable to both experimental 
and computational analysis. Recent examples 
include elucidating the molecular details that 
underlie the alternating access mechanism in a 
minimal sugar transporter (73) and an atomic- 
level description of interactions that lead to barrier 
roughness in protein folding (74). 

The overwhelming growth of sequence data 
also presents new opportunities for computa- 
tional chemists seeking to understand macro- 
molecular structure and function. Evolution is, 
after all, governed by the same physical forces 
that simulations are constructed to model. One 
point of convergence has been the use of evolu- 
tionary records to construct statistical models 
of amino acid sequences (75, 76). Conversely, 
computational biophysics can guide interpreta- 
tions of what mutations do to proteins when 
analyzing exome sequencing for patient diag- 
nosis (76-78). Finally, large-scale deep mutational 
scanning experiments can provide comprehen- 


sive maps of the mutational effects on protein 
stability across entire proteins (79) and enable a 
deeper understanding and benchmarking of our 
ability to predict the consequences of mutations 
(76, 80). 

We thus anticipate that simulations will even- 
tually be commonplace when studying the effect 
of drugs and mutations and will play an essential 
role in the future of bioengineering in the same 
way that computer modeling is used today in 
computational prototyping of cars and buildings. 
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REVIEW 


Inverse molecular design using 
machine learning: Generative models 
for matter engineering 


Benjamin Sanchez-Lengeling’ and Alan Aspuru-Guzik”’®*** 


The discovery of new materials can bring enormous societal and technological progress. In this 
context, exploring completely the large space of potential materials is computationally 
intractable. Here, we review methods for achieving inverse design, which aims to discover 
tailored materials from the starting point of a particular desired functionality. Recent advances 
from the rapidly growing field of artificial intelligence, mostly from the subfield of machine 
learning, have resulted in a fertile exchange of ideas, where approaches to inverse molecular 
design are being proposed and employed at a rapid pace. Among these, deep generative models 
have been applied to numerous classes of materials: rational design of prospective drugs, 
synthetic routes to organic compounds, and optimization of photovoltaics and redox flow 
batteries, as well as a variety of other solid-state materials. 


any of the challenges of the 21st century 

(1), from personalized health care to 

energy production and storage, share a 

common theme: materials are part of 

the solution (2). In some cases, the solu- 
tions to these challenges are fundamentally 
limited by the physics and chemistry of a ma- 
terial, such as the relationship of a materials 
bandgap to the thermodynamic limits for the 
generation of solar energy (3). 

Several important materials discoveries arose 
by chance or through a process of trial and error. 
For example, vulcanized rubber was prepared in 
the 19th century from random mixtures of com- 
pounds, based on the observation that heating 
with additives such as sulfur improved the 
rubber’s durability. At the molecular level, in- 
dividual polymer chains cross-linked, forming 
bridges that enhanced the macroscopic mechan- 
ical properties (4). Other notable examples in 
this vein include Teflon, anesthesia, Vaseline, 
Perkin’s mauve, and penicillin. Furthermore, 
these materials come from common chemical 
compounds found in nature. Potential drugs 
either were prepared by synthesis in a chem- 
ical laboratory or were isolated from plants, 
soil bacteria, or fungus. For example, up until 
2014, 49% of small-molecule cancer drugs were 
natural products or their derivatives (5). 

In the future, disruptive advances in the dis- 
covery of matter could instead come from unex- 
plored regions of the set of all possible molecular 
and solid-state compounds, known as chemical 
space (6, 7). One of the largest collections of 
molecules, the chemical space project (8), has 


‘Department of Chemistry and Chemical Biology, Harvard 
University 12 Oxford Street, Cambridge, MA 02138, USA. 
Department of Chemistry and Department of Computer 
Science, University of Toronto, Toronto Ontario, M5S 3H6, 
Canada. *Vector Institute for Artificial Intelligence, Toronto, 
Ontario M5S 1M1, Canada. “Canadian Institute for Advanced 
Research (CIFAR) Senior Fellow Toronto, Ontario M5S 1M1, 
Canada. 

*Corresponding author. Email: aspuru@utoronto.ca 


Sanchez-Lengeling et al., Science 361, 360-365 (2018) 


mapped 166.4 billion molecules that contain at 
most 17 heavy atoms. For pharmacologically rele- 
vant small molecules, the number of structures is 
estimated to be on the order of 10 (9). Adding 
consideration of the hierarchy of scale from sub- 
nanometer to microscopic and mesoscopic fur- 
ther complicates exploration of chemical space 
in its entirety (10). Therefore, any global strategy 
for covering this space might seem impossible. 
Simulation offers one way of probing this 
space without experimentation. The physics 
and chemistry of these molecules are governed 
by quantum mechanics, which can be solved via 
the Schrodinger equation to arrive at their ex- 
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act properties. In practice, approximations are 
used to lower computational time at the cost of 
accuracy. 

Although theory enjoys enormous progress, 
now routinely modeling molecules, clusters, and 
perfect as well as defect-laden periodic solids, the 
size of chemical space is still overwhelming, and 
smart navigation is required. For this purpose, 
machine learning (ML), deep learning (DL), and 
artificial intelligence (AI) have a potential role 
to play because their computational strategies 
automatically improve through experience (JJ). 
In the context of materials, ML techniques are 
often used for property prediction, seeking to 
learn a function that maps a molecular material 
to the property of choice. Deep generative models 
are a special class of DL methods that seek to 
model the underlying probability distribution of 
both structure and property and relate them in a 
nonlinear way. By exploiting patterns in massive 
datasets, these models can distill average and 
salient features that characterize molecules (12, 13). 

Inverse design is a component of a more 
complex materials discovery process. The time 
scale for deployment of new technologies, from 
discovery in a laboratory to a commercial pro- 
duct, historically, is 15 to 20 years (14). The pro- 
cess (Fig. 1) conventionally involves the following 
steps: (i) generate a new or improved material 
concept and simulate its potential suitability; (ii) 
synthesize the material; (iii) incorporate the ma- 
terial into a device or system; and (iv) characterize 
and measure the desired properties. This cycle 
generates feedback to repeat, improve, and re- 
fine future cycles of discovery. Each step can take 
up to several years. 

In the era of matter engineering, scientists 
seek to accelerate these cycles, reducing the 
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Fig. 1. Schematic comparison of material discovery paradigms. The current paradigm is 
outlined at left and exemplified in the center with organic redox flow batteries (92). A closed-loop 
paradigm is outlined at right. Closing the loop requires incorporating inverse design, smart software 
(93), AI/ML, embedded systems, and robotics (87) into an integrated ecosystem. 
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time between steps. The ultimate aim is to 
concurrently propose, create, and characterize 
new materials, with each component transmitt- 
ing and receiving data simultaneously. This 
process is called “closing the loop,” and inverse 
design is a critical facet (12, 15). 


Inverse design 


Quantum chemical methods reveal properties 
of a molecular system only after specifying the 
essential parameters of the constituent atomic 
nuclei and their three-dimensional (3D) coor- 
dinate positions (J6). Inverse design, as its name 
suggests, inverts this paradigm by starting with 
the desired functionality and searching for an 
ideal molecular structure. Here the input is the 
functionality and the output is the structure. 
Functionality need not necessarily map to one 
unique structure but to a distribution of prob- 
able structures. Inverse design (Fig. 2) uses 
optimization, sampling, and search methods 
to navigate the manifold of functionality of 
chemical space (17, 18). 

One of the earliest efforts in inverse design 
was the methodology of high-throughput virtual 
screening (HTVS). HTVS has its roots in the 
pharmaceutical industry for drug discovery, where 
simulation is an exploratory tool for screening a 
large number of molecules (9, 20). HTVS starts 
with an initial library of molecules built on the 
basis of researchers’ intuition, which narrows 
down the pool of possible candidate molecules 
to a tractable range of a thousand to a million. 
Initial candidates are filtered on the basis of 
focused targets such as ease of synthesis, sol- 
ubility, toxicity, stability, activity, and selectivity. 
Molecules are also filtered by expert opinion, 
eventually considered as potential lead com- 
pounds for organic synthesis. Successful motifs 
and substructures are further incorporated in 
future cycles to further optimize functionality. 

Although HTVS might seem like an ensemble 
version of the direct approach for material 
design, it differs in its underlying philosophy 
(4). HTVS is focused on data-driven discovery, 
which incorporates automation, time-critical per- 
formance, and computational funnels; promising 
candidates are further processed by more ex- 
pensive methodologies. A crucial component is 
feedback between theory and experiment. 

The HTVS methodology has been quite suc- 
cessful at generating new and high-performing 
materials in other domains. In organic photo- 
voltaics, molecules have been screened on the 
basis of frontier orbital energies and photovoltaic 
conversion efficiency and orbital energies (21-24). 
In organic redox flow batteries, redox potential, 
solubility, and ease of synthesis (25, 26) are 
prioritized. For organic light-emitting diodes, 
molecules have been screened for their singlet- 
triplet gap and photoluminescent emission (27). 
Massive explorations of reactions for catalysis 
(28) or redox potentials in biochemistry have 
been undertaken (28). For inorganic materials, 
the Materials Project (29) spawns many appli- 
cations such as dielectric and optical materials 
(30), photoanode materials for generation of 
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Fig. 2. Schematic of the different approaches toward molecular design. Inverse design starts 
from desired properties and ends in chemical space, unlike the direct approach that leads from 


chemical space to the properties. 


chemical fuels from sunlight (37), and battery 
electrolytes (32). 

Arguably, an optimization approach is pref- 
erable to HTVS because it generally visits a 
smaller number of configurations when ex- 
ploring the manifold of functionality. An op- 
timization incorporates and learns geometric 
information of the functionality manifold, guided 
by general trends, directions, and curvature (17). 

Within discrete optimization methods, Evolu- 
tion Strategies (ES) is a popular choice for global 
optimization (33-35) and has been used to map 
chemical space (36). ES involves a structured 
search that incorporates heuristics and proce- 
dures inspired by natural evolution (37). At each 
iteration, parameter vectors (“genotypes”) in a 
population are perturbed (“mutated”) and their 
objective function value (“fitness”) evaluated. 
ES has been likened to hill-climbing in high- 
dimensional space, following the numerical 
finite difference across parameters that are more 
successful at optimizing the fitness. With ap- 
propriately designed genotypes and muta- 
tion operations, ES can be quite successful at 
hard optimization problems, even overcoming 
state-of-the-art machine learning approaches (38). 

In other cases, inverse design is realized by 
incorporating expert knowledge into the op- 
timization procedure, via improved Bayesian 
sampling with sequential Monte Carlo (39), 
invertible system Hamiltonians (J8), deriving 
analytical gradients of properties with respect 
to a molecular system (40), optimizing potential 
energy surfaces of chemical systems (41), or dis- 
covering design patterns via data-mining tech- 
niques (42, 43). 

Finally, another approach involves generative 
models stemming from the field of machine 
learning. Before delving into the details, it is 
appropriate to highlight the differences between 
generative and discriminative models. A dis- 
criminative model tries to determine conditional 
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probabilities (p(y|x)): that is, the probability of 
observing properties y (such as the bandgap 
energy or solvation energy), given 2 (a mole- 
cular representation). By contrast, a generative 
model attempts to determine a joint probability 
distribution p(a, y): the probability of observing 
both the molecular representation and the phys- 
ical property. By conditioning the probability 
on a molecule (x) or a property (y), we retrieve 
the notion of direct (p(y|z)) and inverse design 
(p@ly)). 

As expected, deep generative models are more 
challenging to create than direct ML approaches, 
but DL algorithms and computational strategies 
have advanced substantially in the last few years, 
producing astonishing results for generating 
natural-looking images (44), constructing high- 
quality audio waveforms containing speech (45), 
generating coherent and structured text (46), 
and most recently, designing molecules (47). 
There are several ways of building generative 
models, but for the purposes of this Review, we 
will focus on three main approaches: variational 
autoencoders (VAEs) (48), reinforcement learn- 
ing (RL) (49), and generative adversarial net- 
works (GANS) (44). 

Before describing how each approach differs, 
we consider representations of molecules, which 
in turn determine the types of tools available and 
the types of information that can be exploited in 
the models. 


Representation of molecules 


To model molecular systems accurately, we must 
solve the Schrédinger equation (SE) for the 
molecular electronic Hamiltonian, from which 
we obtain properties relating to the energy, geom- 
etry, and curvature of the potential energy 
surface of our system. In the SE, the molecule 
is represented as a set of nuclear charges and 
the corresponding Cartesian coordinates of 
the atomic positions in 3D space. Meanwhile, 
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Fig. 3. Different types of molecular representations applied to one 
molecule, AQDS, which is used in the construction of organic redox 
flow batteries. Clockwise from top: (1) A fingerprint vector that 
quantifies presence or absence of molecular environments; (2) SMILES 
strings that use simplified text encodings to describe the structure of 


ML algorithms benefit from having representa- 
tions that expose more easily constraints and 
properties of the physics of interest, so a 3D 
representation might not be the most efficient. 
Having a more direct representation allows the 
model to spend fewer computational resources 
learning patterns from first principles. A repre- 
sentation that can span all of chemical space 
should ideally capture all the symmetries of the 
SE: permutational, rotational, reflectional, and 
translational invariance for particles of the same 
type (50). Convolutions, Fourier transforms, and 
determinants are some of the mathematical 
structures that can preserve these symmetries 
and are often incorporated into the representa- 
tion or model (5/7, 52). Molecular representation 
is a current open research problem; there are 
many representations, and no one representa- 
tion seems to work for all properties (53). 
Current molecular representations fall into 
three broad categories: discrete (e.g., text), con- 
tinuous (e.g., vectors and tensors), and weighted 
graphs. Although graphs can be represented as 
sparse matrices, they differ fundamentally in 
how they are processed within a model. Typically 
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1 Fingerprints 


2 SMILES 


Molecule 


5 Coulomb matrix 


electronic density. 


a representation will have a fixed length size 
via padding or the addition of dummy atoms. For 
inverse design, a desired property is invertibility— 
the capability to map back to a molecule struc- 
ture that can then potentially be synthesized 
and characterized. Alternatively, if not invertible, 
it would be sufficient to have an ideal target 
representation and then either scan or evolve a 
molecule to match in a fast manner. Among 
invertible representations, we find molecular 
graphs and Hamiltonians. 

Graphs are a natural representation of mol- 
ecules. Following empirical principles of bond- 
ing, a molecule is interpreted as an undirected 
graph where each atom is a node and the bonds 
are the edges. To reduce complexity, hydrogen 
atoms are treated implicitly because they are 
deduced from standard chemistry valence rules. 
One standard for molecular graphs is SMILES 
strings (54), ID text encodings that follow a par- 
ticular grammar syntax. More advanced repre- 
sentations forgo the text encoding and use a 
weighted graph representation, with a variety of 
vectorized features on edges and nodes such as 
bonding type, aromaticity, charge, and distance. 
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a chemical species; (3) potential energy functions that could 

model interactions or symmetries; (4) a graph with atom and bond 
weights; (5) Coulomb matrix; (6) bag of bonds and bag of fragments; 
(7) 3D geometry with associated atomic charges; and (8) the 


(55-57). Graphs are normally not uniquely repre- 
sented, which can be advantageous for data aug- 
mentation (58) or disadvantageous when this 
representation degeneracy introduces noise to a 
model (53). 

Whereas Hamiltonians rely only on the known 
physics and atomic constants of a molecule, 
the Coulomb matrix representation is based 
on Coulombic forces between charges of each 
atom (59). When combined via concatenation, 
summation, or differences, these base representa- 
tions represent reactions (60), molecular en- 
sembles, or conformers. 

Other representations are better suited to pre- 
diction and could be rendered invertible via lookup 
tables: bag of bonds (62, amons (62), fingerprints 
(63, 64), electronic density (57), symmetry functions 
(65), and chemical environments (50). Figure 3 
shows several of these representations. 


Generative models for exploring 
chemical space 


Molecular representations are often inputs for 
deep neural network (DNN) models. The original 
data are transformed across several stages (layers), 
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usually by a linear transformation followed by 
a nonlinear function. For a given task and as- 
sociated loss function, parameters for each layer 
(weights) are optimized via the backpropagation 
algorithm. When optimized, each intermediate 
(hidden) representation will tend to capture high- 
or low-level transformed features of the original 
data. In this sense, DL is a form of representation 
learning (66) because the DNN architecture is op- 
timized to transform the original data into another 
representation that is more efficient for a given 
task such as regression, classification, or generation. 

By attaching additional structure to the hidden 
representation, either in the form of statistical 
priors or probability distributions, we arrive at 
the idea of latent variable models. Each observed 
datum (molecule) has a corresponding latent 
representation, often a vector, within a latent 
variable space that encodes the relevant semantic 
features of the data. 

The goal of a generative model is to model a 
data distribution, by training a model on large 
amounts of data and attempting to generate data 
like it. The loss function encodes the notion of 
likeness, measuring the differences between two 
distributions, the empirically observed and the 
generated. 

We center most of our discussion on deep gen- 
erative models using SMILES as a representation. 
Nonetheless, many of these approaches are quite 
general and are applicable to other representa- 
tions. We expect future work to extend these ar- 
chitectures toward other molecular representations. 

For the generation of sequences, recurrent 
neural networks (RNNs) (46, 67) serve as a common 
starting point, creating sequences incrementally 
one step at a time and predicting what comes 
next. RNNs can be augmented to take into account 
complex time-dependent patterns with long short- 
term memory cells (67, 68) (LSTMs), and attention 
and memory mechanisms (69). Figure 4 displays 
several architectures for generative models. 


Variational autoencoders, reinforcement 
learning, and adversarial training 


Besides generation tasks, for inverse design the 
generative process must be controlled or biased 
toward desirable qualities. With VAEs, the op- 
timization of properties is performed explicitly 
over a continuous representation. By comparison, 
with GANs and RNNs, the optimization of prop- 
erties can be achieved by biasing the generation 
process, typically with RL by rewarding or penal- 
izing generative behaviors. 

VAEs (48) give control over the data gen- 
eration via latent variables. An autoencoder (AE) 
model includes an encoding and a decoding 
network. The encoder maps the molecule to a 
vector in a lower-dimensional space known as 
the latent space, and the decoder maps the latent 
vector back to the original representation. The 
encoder acts as a compression and the decoder 
as a decompression operation. The AE is trained 
to process and reproduce the original datum. In 
the act of distilling and expanding information, 
the AE is expected to learn some of the essential 
features of the data. The AE is sufficient to re- 
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produce the training data, but it can easily learn 
to memorize the data. To be able to extrapolate 
and sample new molecules, we must fill the 
uncovered spaces of the latent space. The VAE 
achieves better generalizability by constraining 
the encoding network to generate latent vectors 
following a probability distribution on the latent 
space; often the distribution is Gaussian, owing 
to its accessible numerical and theoretical prop- 
erties. Therefore, a molecule is represented not 
as a fixed point but as a probability distribu- 
tion over latent space. In practice, this is done as 
a sampling procedure; when training, noise is 
added to the latent vector, so the VAE must re- 
construct the same molecule from a noisy vector. 

Arguably, the most interesting part of a VAE is 
the latent space. Molecules are represented as 
continuous and differentiable vectors residing 
on a probabilistic manifold. Latent space encodes 
a geometry; for a given molecule, we can sample 
nearby to decode similar molecules, and with 
increasing distance, we decode increasingly dis- 
similar molecules. 

Given two molecules, we can trace a path 
between their corresponding latent coordinates, 
interpolate among the path, and decode inter- 
polated molecules. Initially, VAEs were proposed 
for encoding characters of SMILES and then 
extended to take into account grammar and 
syntax features, which improve the generation 
of syntactically valid structures (70, 71). 

Latent space allows for direct gradient-based 
optimization of properties, as latent space is a 
continuous vector space. Nevertheless, the man- 
ifold of molecules has many local minima. One 
approach has been to explore a smoothed ver- 
sion of the manifold via Bayesian optimization 
(71) or constrained optimization with Gaussian 
processes (47). 

By jointly training the VAE to reproduce mol- 
ecules and properties, in a semi-supervised fash- 
ion, the latent molecular space reorganizes itself 
so that molecules with similar properties are 
close to each other. For a given property, there 
will exist preferred dimensions and directions. 
By changing the quality of their Gaussian pro- 
cesses, Gomez-Bombarelli et al. (47) demonstrated 
the capability of local or global optimization 
across the generated distribution. 

Another way of building a generative model is 
with adversarial training under the GAN frame- 
work. Here, the generator competes against a 
discriminative model; specifically, the generator 
tries to generate synthetic data from sampling a 
noise space, whereas the discriminator tries to 
distinguish data as synthetic or real. Both models 
train in alternation, with the goal of the gen- 
erator learning to structure noise toward produc- 
ing data that the discriminator cannot classify 
better than chance. Convergence for GANs is not 
straightforward and can suffer from several 
issues, including mode collapse and overwhelm- 
ing of the generator by the discriminator during 
training. Improving training for GANs is a cur- 
rent research topic (72), and dealing with dis- 
crete data, which suffers from nondifferentiability, 
has some workarounds (73, 74). 
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To bias the generation process with GANs 
and RNNs, a gradient is needed to guide the 
optimization of a network toward desired prop- 
erties. These properties could be modeled via 
neural networks and backpropagated to the 
generator, as is the case with GANs where the 
optimization metric is that the output looks 
like real data. To incorporate properties from 
chemoinformatic tools, simulations, or experi- 
mental measures, we need to create a gradient 
estimator that can backpropagate the generator. 

The field of RL provides several approaches 
to this problem; among the most prominent are 
Q-learning (49) and policy gradients (75). RL 
considers the generator as an agent that must 
learn how to take actions (add characters) within 
an environment or task (SMILES generation) to 
maximize some notion of reward (properties). 
With SMILES, assigning rewards can only be 
done once the sequence is completed. To over- 
come this problem, Monte Carlo Tree Search 
(MCTS) is often used as it constructs a tree of 
probabilities and weights, simulating several 
possible completions for sequences, evaluating 
their reward, and weighting paths through the 
tree based on their success or failure at the given 
task. The completion behavior (policy) is learned 
as a neural network. 

Because of these features, several molecular 
applications have adopted RL and MCTS for 
generation of drug-like molecules (76-78) and 
reaction synthesis planning (79, 80). 

The aforementioned approaches are not ex- 
clusive; they can be mixed to yield advantages 
from each. For instance, druGAN (87) adopts 
an adversarial autoencoder network, and 
ORGANIC (82, 83) adopts both adversarial 
and RL approaches. 

It should be noted that most results of gen- 
erative models have been used in a pharmaceu- 
tical context, optimizing properties relevant to 
potential drugs such as solubility in water, melt- 
ing temperature, synthesizability, and presence 
or absence of certain substructures. For example, 
Popova et al. (78) optimized molecules for puta- 
tive inhibitors of Janus protein kinase 2 (JAK2), 
and Olivecrona et al. (77) optimized molecules ac- 
tive against the target dopamine receptor type 2. 

Part of the focus on the SMILES representa- 
tion has been driven by the adoption of natural 
language processing deep learning tools. It 
should be noted that SMILES represents only 
a subset of possible molecules; for example, a 
syntactically invalid SMILES string might still 
be a valid molecular structure, but its physics 
is not encoded by basic valence rules as used 
in SMILES. The introduction of more molec- 
ular representations and easy-to-use molecular 
property predictors will expand the use of gen- 
erative models in other molecular contexts. 

Looking ahead, new theoretical developments 
in ML are opening the door for generative models 
dealing with graphs. The VAE framework has 
been extended to molecular graphs (84), and 
message passing networks are used to incremen- 
tally build graphs (57). Even so, there many 
challenges remain; it is not yet clear how one 
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Fig. 4. Schematic representation of several architectures found in 
generative models. RNNs are used for sequence generation. The VAE 
shows the semi-Supervised variant, jointly trained by molecules (x) and 
properties (y). Latent space is denoted with Z, and latent vectors with z. 
In the GAN setting, the noise eventually acquires structure via the 


can deal practically with approximation methods 
for the graph isomorphism problem. 

Additionally, improved sequence generation 
models are possible with the ability to read and 
write to memory (69). These approaches demon- 
strate better ability for learning long- and short- 
term patterns. More work is needed on Riemannian 
optimization methods that exploit the geometry 
of latent space. Structured architectures such as 
multilevel VAE (85) offer new ways of organizing 
latent space and are promising research direc- 
tions. New approaches also lie in inverse RL, 
geared toward learning a reward or loss function 
(86). Research in this direction will allow for the 
discovery of reward functions associated with 
different materials discovery tasks. 


Outlook 


Inverse design is an important component of the 
complex framework required to design matter at 
an accelerated pace. The tools for inverse design, 
especially those stemming from the field of ma- 
chine learning, have shown rapid progress in 
the last several years and have allowed chemical 
space to be framed into probabilistic data-driven 
models. Generative models produce large numbers 
of candidate molecules, and the physical realiza- 
tions of these candidates will require automated 
high-throughput efforts to validate the genera- 
tive approach. The community has yet has to 
show more than a few examples of successful 
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closed-loop approaches for the design of matter 
(87). The blurring of the barriers between theory 
and experiment will lead to Al-enabled auto- 
mated laboratories (88, 89). 

The combination of inverse design tools with 
active learning approaches such as Bayesian 
optimization (90, 91) can enable a model that 
adapts as it explores chemical space, which 
allows for expanding a model in regions of 
high uncertainty and enabling the discovery 
of regions of molecular space with desirable 
properties as a function of composition. Active 
learning in the space of objective functions could 
lead to a better understanding of the best rewards 
to seek while carrying out machine learning. 

As seen, central to machine learning meth- 
odologies is the representation of molecules; 
representations that encode the relevant physics 
will tend to generalize better. Despite consider- 
able progress, much work remains. Graph and 
hierarchical representations of molecules are an 
area requiring further study. 

The integration of machine learning as a new 
pillar of knowledge in the curricula of chemical, 
biochemical, medicinal, and materials sciences 
will allow for a more rapid adoption of the meth- 
odologies summarized in this work. 
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REVIEW 


Fundamentals of numerical relativity 
for gravitational wave sources 


Bernd Briigmann* 


Einstein’s theory of general relativity affords an enormously successful description 
of gravity. The theory encodes the gravitational interaction in the metric, a tensor 
field on spacetime that satisfies partial differential equations known as the 
Einstein equations. This review introduces some of the fundamental concepts 

of numerical relativity—solving the Einstein equations on the computer—in simple 
terms. As a primary example, we consider the solution of the general relativistic 
two-body problem, which features prominently in the new field of gravitational 


wave astronomy. 


he basic equations of general relativity are 

the Einstein equations, first published in 

1915 (1). However, even today there are 

large gaps in our understanding of the 

physics implied by the Einstein equations. 
Stated in general terms, a major goal of research 
in general relativity is to solve the Einstein equa- 
tions for the physical situations of interest. 
Fundamental analytic solutions of the Einstein 
equations include the flat Minkowski spacetime 
known from special relativity, the Schwarzschild 
and Kerr spacetimes describing single black holes, 
and the simple Big Bang cosmologies. Also pre- 
dicted by general relativity are gravitational waves, 
which for weak fields can be obtained as analytic 
solutions of the linearized Einstein equations. 
However, the few known analytic solutions de- 
scribe only very special situations, and approx- 
imation methods fail in the regime where the 
nonlinear, strong-field effects of relativity play 
a crucial role. If we are interested in the truly 
relativistic regime, we must turn to computer 
simulations to obtain numerical solutions to 
the full Einstein equations. 

Solving the full Einstein equations on the 
computer is the subject of numerical relativity, 
which could also be called computational gen- 
eral relativity. Computers also play a role in 
algebraic computations and in approximation 
schemes, and such calculations are important 
topics in numerical relativity. But the distinguish- 
ing feature of numerical relativity is that, in 
principle, the Einstein equations in full generality 
can and must be solved numerically. 

Numerical relativity spans a large range of dif- 
ferent topics including mathematical general 
relativity, astrophysics, numerical methods for 
partial differential equations, computer prog- 
ramming, and simulation science. Current re- 
search in numerical relativity is in a transition 
from a self-contained topic in theoretical physics 
to a physical theory with numerous connec- 
tions to observational astronomy (2, 3). Gravita- 
tional wave astronomy holds much promise for 
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the future, as recognized by the 2017 Nobel Prize 
in Physics, and numerical relativity is provid- 
ing key theoretical predictions and analysis 
tools for the ongoing gravitational wave ob- 
servations (4). 


The general relativistic 
two-body problem 


As a primary application of numerical relativity, 
we consider the gravitational two-body problem. 
The two-body problem in Newtonian gravita- 
tional physics can be formulated for two point 
masses moving in their mutual gravitational field. 
A particular solution of the Newtonian two-body 
problem is a Keplerian elliptical orbit. However, 
in Einsteinian gravity, such orbital motion gen- 
erates gravitational waves that carry away energy 
and momentum. Binary orbits therefore decay, 
and the motion of the two bodies follows an 
inward spiral that eventually terminates with 
the collision and merger of the two objects. In 
most astrophysical situations, the energy loss 
due to the emission of gravitational waves is so 
small that a binary orbit decays only on time 
scales of millions or billions of years. However, 
for compact objects such as neutron stars or 
black holes in very tight binaries, general rel- 
ativistic effects such as gravitational wave emis- 
sion play a major role (5). 

Research in this field seeks to provide a theo- 
retical framework for the physics of binary black 
holes, neutron stars, and gravitational waves. 
Such an endeavor must rely on numerical sim- 
ulations in general relativity and general relativ- 
istic hydrodynamics. But a reasonably complete 
framework still requires substantial progress in 
numerical relativity and related fields. Currently 
there are serious limitations in our ability to 
model the entire range of relevant physics, from 
the nuclear physics of neutron star matter to the 
large-scale, strong-gravity effects encountered in 
binary neutron star mergers (6). The different 
dynamical phases of the binary evolution—known 
as the inspiral, the merger, and the evolution of 
the remnant—are accompanied by characteristic 
gravitational wave signatures (Fig. 1). For binaries 
involving at least one neutron star, depending on 
the specifics of the system, key features include 


the disruption of the star(s) before merger, the for- 
mation of a hypermassive neutron star, the prompt 
or delayed collapse to a black hole, the dynamics 
of the accretion torus plus the central merged 
object, and the creation of unbound material, 
the ejecta. Before discussing simulations of these 
systems, we introduce the mathematical founda- 
tion of numerical relativity. 


Mathematical foundation 


Combining space and time into spacetime can 
be considered a triumph of human thought, al- 
lowing us to perceive the true nature of relativ- 
istic and gravitational physics (7). However, this 
does not mean that we cannot or need not 
consider space and time separately. Somewhat 
ironically, after working hard to unify space 
and time, the mathematical setup of numerical 
relativity starts by splitting spacetime again 
into space and time and by making gauge 
(coordinate) choices (8) in order to reformulate 
the Einstein equations as a well-posed mathe- 
matical problem. 

General relativity is the theory of a metric 
tensor on a four-dimensional manifold, plus 
matter described by additional tensor fields. 
A manifold M endowed with a metric gz, is 
called a spacetime (M, g,,). The metric mea- 
sures lengths, here in four dimensions. The 
infinitesimal line element 


3 
ds? = gqpda"dx’ =~ gqrdadx? (1) 
a,b=0 


provides a generalization of the Pythagorean 
theorem. Repeated indices are summed over, 
following the Einstein summation convention. 
The metric is symmetric (2,5 = 85a), has a Lorentz 
signature of - + + +, and there exists an inverse 
metric g”” defined by g@g;, = 54, where &¢ is 
the identity matrix. A special example is the 
Minkowski line element ds? = -c’dt? + dx? + 
dy’ + dz’, where c is the speed of light, ¢ is the 
time coordinate, and 2, y, and z are spatial co- 
ordinates. The components of the Minkowski 
metric are constants, but in general gy, is a field 
with nonconstant components. 

The field equations of general relativity are 
the Einstein equations, 


Gav(g, Og, 07g) = 8nTw(g, ®) (2) 


where Gz is the Einstein tensor, which depends 
on the metric and its first and second derivatives, 
and 7, is the stress-energy tensor constructed 
from the matter fields ® and in general also from 
the metric. For numerical implementations, the 
first step is to write the Einstein equations as a 
well-posed system of partial differential equations 
(PDEs) for the metric. Equation 2 represents 10 
coupled, nonlinear PDEs for the 10 independent 
components of the metric, but without further 
adjustments these equations are in no known 
sense hyperbolic (i.e., well-posed as an initial 
value problem). 
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The differential operator acting on the metric 
in the Einstein equations is given by the Ricci 
tensor, 


1 Bit 
Rw = — 58" (OcOaae + OaOpSca 


— 0,0a80a — OcOvBaa) + Rin(g,0Z) (3) 


The first term by itself, ¢°“0.0a2,», which is 
often denoted °ag,, (where is the d’Alembert 
operator), has the form of the principal part of 
a simple hyperbolic wave equation, but note 
that the metric appears in two places: as the 
wave field g,, and as the inverse metric gt in 
the wave operator. The other second-derivative 
terms are not standard wave operators. The 
best we can say about the complete principal 
part in Eq. 3 is that it is quasi-linear in the metric; 
that is, it is linear in the highest-order derivatives 
but with coefficients that depend (nonlinearly) 
on the variable itself. The lower-order terms 
R!,(g, 0g) are quite involved as well, with ty- 
pical terms of the form g°‘g"'dgdg. Approach- 
ing the problem in this way makes it difficult 
to recognize that these equations are describ- 
ing the simple geometric concept of curvature 
and that there is a time evolution being de- 
fined. Further, their well-posedness properties 
are quite unclear. 

The so-called 3+1 decomposition—for example, 
in the form of Arnowitt, Deser, and Misner (ADM) 
(8)—assumes that the manifold (at least locally) 
allows a split into time and space, M = R x . 
Physics is then describable by time-dependent 
tensors on three-dimensional hypersurfaces ~, 
which correspond to ¢ = constant slices of M, 
resulting in a “foliation” of spacetime in terms 
of three-dimensional spaces. Geometrically, we 
obtain a normal vector 7“ to = that allows the 
decomposition of tensors in directions normal 
and tangential to the hypersurfaces. These are 
the time-like and space-like directions, respec- 
tively. For concreteness, we can assume coor- 
dinates a” = (t, x’) = (t, a, y, 2) with a time 
coordinate x° = t and spatial coordinates x’, 
where 7 = 1, 2,3 and a = 0, 1, 2, 3. 

Decomposing the Einstein equations is accom- 
plished by projecting G,, and R,,, in the directions 
parallel and orthogonal to n“. We discover that 
the differential operator Eq. 3 leads to two types of 
equations: (i) evolution equations containing 
time derivatives, and (ii) four constraint equa- 
tions that are essentially elliptic equations, 
highlighting the indeterminate type of Eq. 3. The 
constraints are the Hamiltonian constraint and 
the momentum constraints. The latter are re- 
miniscent of the Gauss law constraint of elec- 
trodynamics, where the divergence of the electric 
field gives the charge density. 

Given the evolution and constraint equations 
as PDEs, we still must choose a spatial and tem- 
poral domain with boundary conditions. For 
problems in astrophysics such as the two-body 
problem of black holes and neutron stars, we 
consider isolated systems where at large dis- 
tances gravitational fields become weak and 
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spacetime becomes asymptotically flat (in con- 
trast to typical cosmological models). Because 
gravity is universally attractive and long-range, 
it is not natural to restrict a system to a finite 
box, especially given that the goal is to compute 
waves traveling to infinity. Nonetheless, a ty- 
pical configuration for numerical simulations 
is a finite-size spatial domain (e.g., a sphere) 
with boundary conditions at some finite radius 
that implement the proper fall-off of the fields 
and an outgoing-wave boundary (9). 

Features unique to numerical relativity are 
various aspects of black hole spacetimes, in par- 
ticular the causal structure associated with black 
hole event horizons and the possibility of space- 
time singularities. This latter aspect can be 
viewed as the problem of specifying additional 
boundaries that represent black holes within the 
simulation domain. 


Building blocks of numerical relativity 


To define a particular strategy to solve the 
Einstein equations, we consider the following 
building blocks that define the anatomy of a 
numerical relativity simulation, with a focus on 
the compact binary problem. The following items 
are certainly relevant to any evolution problem 
in computational physics: initial data, evolution, 
analysis, and numerics. We must specify the ini- 
tial conditions, integrate the equations of motion 
to obtain the evolved data, and perform an 
analysis of the evolved data to extract physical 
information. The numerical treatment of each 
item may require the implementation of specific 
numerical techniques. 


Evolution 


Formulation: Choose one of many inequivalent 
formulations (i.e., choose variables and rewrite 
the Einstein equations to obtain a well-posed 
initial value problem). Choose the order of time 
and space derivatives; make structural choices 
about the gauge and the constraints. 
Reformulating the Einstein equations as a 
well-posed initial value problem has been the 
subject of much research (10, (11). To give an 
example, the result of the generalized harmonic 


1.0 
0.0 


-10 
-20 -10 


Wave Amplitude 


gauge (GHG) formulation (72) can be cast in a 
standard first-order PDE form as 


d,u! +.A™,(u)O;u" = S"(u) (4) 


Here, the state vector uw" collects all 10 com- 
ponents g,,, the 40 first derivatives 0.8a), 
and a few additional fields depending on the 
formulation. In addition, there may be variables 
for the matter fields. Equation 4 for GHG is 
strongly and even symmetric hyperbolic (72) and 
is well suited for numerical implementation. 
Another standard way to proceed is the classic 
ADM formulation that makes the geometry 
of the 3+1 decomposition in time and space 
more explicit. Basic variables are the 3-metric 
gj and the extrinsic curvature K;, which is es- 
sentially the first time derivative of the metric 
(8). The ADM equations are only weakly hyper- 
bolic and are not suitable for numerics. How- 
ever, closely related systems, the so-called BSSN 
and Z4c formulations (JO, 17), are strongly hy- 
perbolic. Most current simulations in numerical 
relativity rely on either the GHG or BSSN/ZAc 
families of formulations. 

Constraint propagation: Maintain the con- 
straints during evolution. Perform free evolutions 
and monitor the convergence of the constraints; 
use, for example, constraint damping to maintain 
the constraints explicitly. 

Analytically, the constraints propagate; that 
is, if they are satisfied initially, they remain sat- 
isfied during a well-posed evolution. Numerically, 
even small rounding errors can trigger divergence 
from the constraint-satisfying solution, which can 
lead to a catastrophic failure of the simulation. 
How the constraints are controlled is a dis- 
tinguishing feature of each formulation. A key 
ingredient in stable binary black hole evolu- 
tions (23) is the constraint-damping scheme (/4). 
The Z4c formulation improves BSSN in the way 
the Hamiltonian constraint is treated, which 
leads to improved conservation of mass for 
neutron star simulations (15). Apart from in- 
stabilities, constraint violations in 3+1 rela- 
tivity signify a problem with four-dimensional 
covariance. The 3+1 decomposition breaks 


0) 10 20 


t[ms] 


Inspiral Merger 


Remnant 


Fig. 1. Binary neutron star mergers emit gravitational waves. The waves reveal unique 
information about extreme gravity and extreme matter—information that can be unraveled with 
the help of numerical relativity. Shown is a waveform and snapshots of the neutron star matter for 
the inspiral, merger, and remnant. The amplitude of the gravitational wave is plotted versus time. The 


merger occurs at t = O. 
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covariance of the full theory by choosing a 
foliation, but the constraints ensure that four- 
dimensional covariance is maintained. 

Gauge: Choose a coordinate condition—for 
example, in terms of lapse and shift or in terms of 
gauge source functions. Construct coordinates 
that avoid physical and coordinate singularities 
and are suitable for the black hole problem. 

The main point about the gauge choice is that 
not only do we have the freedom to choose co- 
ordinates, but it is necessary to choose nontrivial 
coordinates. For example, even for the simplest 
black hole spacetimes, a foliation can fail by 
running into the physical singularity, and the 
hypersurface (or slice) may become badly dis- 
torted by slice stretching when points start fal- 
ling into the black hole. The topic of how to 
dynamically construct good coordinates that 
lead to stable evolutions, cover spacetime with 
a regular foliation, avoid coordinate singularities, 
and avoid physical singularities inside black holes 
has become its own area of interest. In that 
context, the 3+1 decomposition is about “space- 
time engineering” because we not only evolve the 
metric variables, but also build up the spacetime 
slice by slice in coordinates that are constructed 
dynamically during the evolution. The GHG for- 
mulation relies on the harmonic gauge to obtain 
hyperbolicity (12). For BSSN, the moving punc- 
ture gauge is essential to obtain long-term black 
hole evolutions, preventing slice stretching (6) 
and allowing the black hole punctures to move 
freely (17, 18). 

Boundary conditions: Specify outer boundary 
conditions appropriate for outgoing waves and 
asymptotic flatness. Specify inner boundaries for 
black holes; choose between black hole excision 
and black hole punctures. Handle coordinate 
patch boundaries. 

Although approximate boundary conditions 
are possible, for a clean treatment, strong or 
symmetric hyperbolicity is required for well- 
posedness (17). We can then specify boundary 
conditions in terms of the ingoing and outgoing 
characteristic fields. The outer boundary condi- 


tions in numerical relativity tend to be substantially 
more complicated than the Einstein equations 
themselves, because outgoing-wave boundaries 
are typically constructed by taking additional 
derivatives of the right sides of the equations 
(10). The Einstein equations share with other 
nonlinear wave equations the feature that there 
is backscattering by waves off themselves (and, 
for binary systems, also due to the gradient in the 
gravitational well). This is a fundamental prob- 
lem for boundaries at finite radius, because in 
principle we must account for all future back- 
scattering from outside the domain. Consistent 
boundaries at finite radius have only been ad- 
dressed quite recently, considering the long his- 
tory of the Einstein equations (19, 20). 


Initial data 


Formulation: Rewrite the constraints as elliptic 
equations, identifying suitable free and dependent 
variables. 

To give an indication of what the formulation 
of the constraints entails (8), consider a conformal 
rescaling of the metric, gay) = w*&,», which 
conveniently transforms the Hamiltonian con- 
straint into a scalar elliptic equation for y. We 
have the freedom to specify a conformal metric, 
ap, that is not physical because it does not solve 
the constraints, but by solving the elliptic equa- 
tion for y we find a physical solution g,,, that 
solves the constraints. The conformal transverse- 
traceless decomposition (8) is widely used for the 
full set of constraints; for neutron star initial 
data in particular, the conformal thin-sandwich 
construction (2/, 22) is used, where typically an 
additional elliptic equation is added to initialize 
the gauge condition. 

Physical content: Solve the constraints for 
data that contain multiple black holes or neutron 
stars with arbitrary mass, spin, and momentum. 

Because the constraints are nonlinear, we 
cannot simply “add up” the metric tensors of, 
for example, two Schwarzschild black holes to 
obtain binary data, although that can be a useful 
approximate initial guess. As a result, some as- 


J, 


Fig. 2. Binary neutron star evolution with spin and precession. As a result of the general 
relativistic frame-dragging effect, a binary of neutron stars with (unaligned) spin will not move within 
a fixed orbital plane. (A) The orbital motion, indicated by two different colors for the two stars. 

(B) The angular momentum. Both can show precession and nutation effects, which will also be 
visible in the gravitational wave signal. The axes indicate spatial coordinates (A) and vector 


components of the spin (B). 
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pects of the initial data construction are in- 
direct. For example, we can start with two single 
black hole solutions for particular masses, which 
will be combined to form a binary. But solving the 
constraints for the binary leads to a change in the 
individual masses of the black holes because of 
the conformal rescaling. In some cases we have 
to perform evolutions to determine whether the 
initial data were constructed appropriately for a 
particular dynamical situation. 

There is a growing variety of initial data con- 
structions for binaries that correspond to the 
variety of physical configurations. For black holes, 
there are excision-type data, where the interior of 
black holes is removed (23, 24). Alternatively, 
black hole puncture data handle the black hole 
interior with a coordinate singularity at a point 
(25), which sometimes is called automatic ex- 
cision. The thin-sandwich formulation is well 
suited for quasi-equilibrium data of black holes 
and/or neutron stars, which, for example, can 
approximate the state of a binary system during 
a quasi-circular inspiral (26). Only quite recently 
have methods been developed for neutron stars 
that generalize the quasi-equilibrium, quasi-circular 
construction to eccentric orbits (27) and to neutron 
stars with spin (28) (Fig. 2). Solving the constraints 
for electromagnetic field configurations is another 
recent topic of investigation (29). 


Analysis 


Black holes and neutron stars: Determine all 
physical parameters during evolution. Find hor- 
izons of black holes. Analyze the rich phenome- 
nology of neutron star mergers with the remnant, 
torus, jets, and ejecta. Connect to multi-messenger 
astronomy. 

In any binary simulation, a wide range of de- 
tailed information is of interest, especially when 
matter is involved. The “relativity” in general rela- 
tivity means, however, that many quantities have 
no direct physical meaning. In general, any tensor 
component (such as gy OF Zzy) is not meaningful 
by itself; we have to construct proper gauge- 
invariant quantities. For example, mass and spin 
must be carefully defined because their local 
meaning at a point is problematic. For black 
holes, special methods are required to find the 
event horizon, which is a global concept in space- 
time and therefore expensive to compute. See 
Fig. 3 for examples. Instead, black hole excision 
relies on the apparent horizon [e.g., (13)]. 

Gravitational waves: Compute gravitational 
wave emission; control numerical and systematic 
errors. Produce gravitational wave templates in a 
form that is ready to use for gravitational wave 
detectors. Treat both waveform prediction and 
waveform analysis. 

Gravitational waves are propagating varia- 
tions in the metric tensor, and the challenge is 
to separate the physical waves from various 
coordinate effects. In the weak-field limit, we 
can define gravitational waves as small perturba- 
tions around a background metric, and a first- 
order gauge-invariant formalism can be used to 
eliminate leading-order gauge effects (30). Such 
methods are applicable because we assume that 
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Time 


the detectors are located far from the source 
where an asymptotically flat background is 
available. In simulations, the numerical grids 
often include extra patches for the far zone 
[e.g., (9)], possibly at lower resolution (see below). 

A major effort in numerical relativity is di- 
rected toward obtaining accurate waveforms 
with controlled error bars for long time inter- 
vals. For the signal-to-noise ratio of current ob- 
servations, a sufficiently accurate waveform 
model may begin with a post-Newtonian approx- 
imation (assuming nonrelativistic speeds) for 
the initial inspiral, matched to 10 to 20 orbits 
up to and including the merger from numerical 
simulations of the full Einstein equations. Ini- 
tially, the goal was to filter the signal out of the 
noise by matching against theoretical wave- 
forms. However, as the quality of the signals is 
improving, the main goal of gravitational wave 
astronomy is to estimate unknown source pa- 
rameters. For example, we need detailed wave- 
form models to distinguish black hole mergers 
from neutron star mergers, determine masses 
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Fig. 3. The twisted pair of pants. (A and B) Spacetime plot 
of the event horizon of two inspiraling black holes that merge 
and ring down: equal (A) and unequal (B) masses, time t 
running up, horizontal x-y slices of the event horizon (80). 
(C) Pair of pants computed numerically in the 1990s for 
axisymmetric, head-on collisions, time t running up, horizontal 
slices in z-p coordinates (81). 


and spins, etc. The first detection of gravita- 
tional waves by Advanced LIGO (2) was ac- 
companied by a theory paper describing how 
the properties of GW150914 were deduced from 
the observational data (4). Only by combining 
data with theory was it possible to arrive at the 
interpretation of GW150914 as the signature 
of a binary black hole merger, with specific pa- 
rameters and credibility intervals. Two families 
of models were used, the EOBNR and Phenom 
families of waveforms (2) (Fig. 4). To analyze 
the data stream from the detectors, various pa- 
rametrized waveform models are being devel- 
oped for high-speed template matching (e.g., 
reduced-order surrogate models) (37). 


Numerics 


Discretization: Choose a discretization in space 
and time. Introduce adaptive mesh refinement 
(AMR) in space and time to efficiently represent 
different physical length scales. Choose coordinate 
patches and transformations to adapt coor- 
dinates to the underlying physics. 


Once a suitably hyperbolic form of the PDEs 
of general relativity has been derived, we have 
access to several standard discretizations from 
applied mathematics. The recent trend has been 
toward high-order discretizations, with different 
choices for the geometry and the matter fields. In 
vacuum or where the matter is smooth, the geo- 
metry is smooth as well. For smooth metrics, 
fourth- to eighth-order finite differencing in space 
is applied routinely, as well as pseudospectral 
methods for exponential convergence. Neutron 
star matter is represented by general relativistic 
fluids, and handling relativistic shocks becomes 
important. Several high-resolution shock-capturing 
(HRSC) fifth-order methods are available (6), 
as is work on smoothed particle hydrodynamics 
(32, 33). 

The physics of a binary involves several phy- 
sical scales. The wavelength of gravitational 
waves near merger is about 100 times the size 
of the black holes, and the simulation domain 
is typically chosen to be at least 1000 times the 
size of the black holes to accommodate several 
wave cycles. Simulations in three spatial dimen- 
sions therefore become several orders of mag- 
nitude more efficient with AMR, often of the 
Berger-Oliger type with refinements not just in 
space, but also in time. Many codes use several 
coordinate patches to transition from two (or 
more) central objects to spherical shells near 
the outer boundary. 

Scientific computing: Implement parallel al- 
gorithms for high-performance computing. Invest 
in professional software engineering for a collab- 
orative computational infrastructure. 

Numerical relativity has been very successful 
with the hybrid MPI (message passing interface) 
plus OpenMP (open multiprocessing) or a similar 
parallelization strategy. Still, a typical numerical 
relativity simulation for a binary coalescence, 
representing just a single data point in a tem- 
plate catalog, may take roughly 1 month on 1000 
to 10,000 cores of a supercomputer. The numerical 
relativity community is working on improving the 
efficiency of these methods, including spectral 
methods and improved AMR schemes, which tend 
to be a bottleneck for massive parallelism. Most 
efforts in numerical relativity are group efforts 
with a long-term investment in an evolving 
code base. These efforts include SpEC (34), 
SACRA (35), Whisky/THC (36), Pretorius (37), 
HAD (38), BAM (39), and the community code 
Einstein Toolkit (40). Some codes approximate 
general relativity but provide more advanced 
neutron star physics (32, 33). Although similar 
in some regards—after all, the same or similar 
physics is studied—the different projects vary 
greatly in the range and the specifics of the 
physics modules, the flexibility and extensibility 
of the codes, the level of software optimization, 
and the collaboration and code-sharing models. 

The main challenge common to all these pro- 
jects is that they are implementing a “moving 
target,” as formulations and basic equations 
are still changing and more physics is added 
to the simulations. Simultaneously, they must 
handle the trend in technology toward massively 
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parallel computers and heterogeneous hardware, 
which is challenging given the complex algo- 
rithms required for numerical relativity. 


Short history of binary simulations 


The first simulations of black holes in vacuum 
were attempted in 1964 (41). By the 1970s, many 
concepts of the 3+1 ADM formulation had been 
brought into numerical relativity (42), which led 
to the seminal numerical work on head-on 
(axisymmetric, 2+1-dimensional) black hole col- 
lisions and gravitational waves (43, 44). It took 
until the early 1990s (45, 46) to revisit the head- 
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on collision with improved numerics, which con- 
firmed the early results on gravitational waves 
(46). Numerical relativity in 3+1 dimensions be- 
gan in 1995 with the evolution of a Schwarzschild 
black hole on a Cartesian grid (47) and the evol- 
ution of gravitational waves (48), followed by the 
first fully 3+1-dimensional simulation of a black 
hole binary (49, 50). All the early black hole sim- 
ulations mentioned so far were numerically un- 
stable, with barely enough evolution time to start 
with two separate black holes that promptly 
merged. The first full orbit was achieved in 
2004 (51). In 2005-2006, the last missing in- 
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Fig. 4. Numerical waveform catalogs anticipated the first gravitational wave observations. Shown 
are examples for template construction for gravitational waves from binary black hole mergers. (A) Various 
numerical waveforms computed by different research groups forming an international collaboration. 

(B) Combining post-Newtonian models for the inspiral with numerical relativity. In (A) and (B), the 
amplitude of the gravitational wave is plotted versus time. The merger occurs at t = O. In (B), the numerical 
waveform is preceded by a post-Newtonian waveform to cover more orbits of the inspiral. Such 
waveforms, which were purely theoretical, became real with the first observation of gravitational waves 
in 2015 [compare to figure 1 of (2)], making it possible to interpret the first signals as the merger events of 


two black holes. 


Briigmann, Science 361, 366-371 (2018) 


27 July 2018 


gredients for long-term stable black hole evolu- 
tions were found in two different approaches, 
one based on a harmonic gauge formulation and 
excision (13) and the other based on the BSSN 
formulation and black hole punctures (17, 18, 57). 
By 2010, the robustness and flexibility of these 
methods had been established. Improvements in 
the formulations, the boundary conditions, etc., 
are still ongoing today (Z/, 12, 15). 

Neutron star simulations were pursued in 
parallel with the black hole simulations. The 
Valencia formalism of general relativistic hydro- 
dynamics (GRHD), now the primary approach, 
was developed in the 1990s (52). The first fully 
general relativistic binary neutron star simula- 
tions were published in 2000 (53), with enor- 
mous progress in many groups since then. As far 
as the geometry of general relativity matters in 
these simulations, it turns out that the methods 
established for stable black hole simulations 
carry over to neutron star simulations (gauge, 
boundaries, initial data formulation, etc.). How- 
ever, GRHD introduces its own challenge of 
relativistic shocks, and the range of different 
physics phenomena makes this a much more 
complex problem than black holes in vacuum. 


Outlook 


Numerical relativity is developing rapidly in 
several directions, and we highlight a few repre- 
sentative examples. 


High-order methods 


High-order methods to address the ever-increasing 
demand for even more accurate and detailed sim- 
ulations are a major topic of current research. 
Among the different high-order methods to solve 
partial differential equations, the discontinuous 
Galerkin (DG) method has emerged in recent 
years as a particularly successful general-purpose 
paradigm (54). It can be argued that the DG 
spectral-element method subsumes several of the 
key advantages of traditional finite-element 
and finite-volume methods. In particular, the DG 
method works with element-local stencils, which 
is a great advantage for parallelization and the 
construction of complicated grids. Furthermore, 
DG methods offer easy access to hp-adaptivity, 
where both the size of the computational ele- 
ments (or cells) and the order of the polynomial 
approximation within each element can be 
adapted to the problem. 

There are three major efforts to use DG methods 
for general relativity and/or GRHD (55-57). The 
first simulations of a single neutron star were 
achieved recently (55, 58), and simple binaries 
are a work in progress. With regard to high-order 
approximations, there is no doubt that if expo- 
nentially convergent spectral methods such as 
DG (or pseudospectral methods) are applicable, 
they will constitute a big improvement over 
finite-difference approximations, which give only 
polynomial convergence. High-order methods 
can provide breakthroughs by reaching accu- 
racies that make new physics possible (e.g., for 
magnetic field amplification due to small-scale 
turbulence) or by reducing numerical errors to 


5 of 6 


8102 ‘9g Ainr uo /Bio'bewsoualos‘eoua!0s//:dyjy wos} pepeojumoq 


PEJIU3 NOATOTOBUJIA TPYNNA "What's News" VK.COM/WSNWS 


make gravitational wave analysis possible. Viewed 
differently, we can reach a given error criterion 
with much lower computational resources, mak- 
ing simulations feasible that are otherwise too 
computationally expensive. 


Multi-physics 


The spectacular first observation of both gravita- 
tional waves (3) and electromagnetic radiation 
(59, 60) from a neutron star merger represents the 
beginning of multi-messenger astronomy includ- 
ing gravitational waves. To model such systems, 
we need to perform “multi-physics” simulations. 

Modeling electromagnetic fields in GRHD 
can be accomplished by coupling the Max- 
well equations to the GRHD equations, for 
which the prevalent approach has been ideal 
magnetohydrodynamics (IMHD). The assump- 
tion of IMHD is that the fluid has zero resistivity, 
but for the merger—and in particular for the 
fields surrounding the remnant with torus and 
ejecta—the quality of that approximation is un- 
clear. Resistive magnetohydrodynamics (RMHD) 
is expected to be important for realistic models of 
plasma instabilities and magnetic reconnection. 
Apart from unknown physics, the mathematical 
character of the RMHD equations may be prob- 
lematic (67, 62). There are only a few general rela- 
tivistic simulations with RMHD [e.g., (61, 63, 64)]. 
Developing a proper theory of resistive relativistic 
plasmas is a large project in itself (65). 

The microphysical equation of state of neu- 
tron stars remains unknown and is also a tar- 
get for numerical models and for observations. 
Investigations may include 20 or more different 
equations of state in an attempt to cover all 
sensible proposals. Even determining just one 
parameter—the existence of neutron stars with 
2.0 solar masses (66, 67)—provided a strong con- 
straint. In principle, gravitational wave observa- 
tions can do much better, gleaning information 
from the inspiral and the merger. Although in- 
spiral signals will show rather systematic long- 
time effects (68-70), one of the grand challenges 
will be to disentangle the much more messy 
merger signal (71). 

Standard merger models predict strong heat- 
ing of the neutron star matter, which is expected 
to lead to an enormous amount of neutrino emis- 
sion with luminosity on the order of 10 ergs’. 
This burst of energy plays a role in models of 
short gamma ray bursts (72) and also for the 
ejecta, which in turn affects heavy-element pro- 
duction and macro- or kilonovae (73). However, 
currently the high dimensionality of such radia- 
tive transport problems (3+1 spacetime plus 3 for 
the radiative transport) is prohibitive, leading 
to a wide array of approximations with variable 
applicability (74, 75). A coherent picture for neu- 
trino physics in binary mergers is still lacking but 
should be a part of multi-messenger astrophysics. 


Beyond current astrophysics 


Numerical relativity has a large number of ap- 
plications outside the area of compact binaries 
and gravitational waves (76, 77). Topics include 
gravitational collapse with surprising critical 
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phenomena, boson stars and other exotic matter, 
and cosmological simulations. Going beyond clas- 
sical general relativity, the field of numerical 
relativity for alternative gravity theories and 
gravity in higher dimensions is wide open. 


Conclusion 


The next decade is sure to see numerical rel- 
ativity grow in terms of computational power 
and applicability to different physical scenarios. 
The detailed theoretical models for black hole 
and neutron star binaries that are the target of 
research in numerical relativity are closely linked 
to the observation of gravitational waves. Numer- 
ical relativity, in combination with the highly 
anticipated future observations of gravitational 
waves, is expected to provide entirely new insights 
into extreme gravity and extreme matter. 
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Public engagement helps scientists tackle global challenges 


Leshner Leadership Institute fellows use public interactions for mutual learning 


By Anne Q. Hoy 


Over the coming year, scientist Merritt Turetsky will be working 
with the Royal Society of Canada to coordinate a series of public 
dialogues on challenges facing residents of Canada’s Arctic region, 
home of the world’s fastest-changing climate. 

Climbing at twice the world’s average rate, air temperatures 
throughout the Arctic region are altering ecosystems. In Canada’s 
Northwest Territories, the shifts are impacting plants, animals, and 
residents and putting pressure on food and water security, Turetsky 
noted during a training program to make scientists effective listen- 
ers and communicators with the public. The fellowship program 
is offered under the auspices of the American Association for the 
Advancement of Science’s Alan |. Leshner Leadership Institute for 
Public Engagement with Science. 

Residents of Ottawa, Toronto, Montreal, and other Canadian 
cities will be invited to explore and discuss the transformations un- 
folding in Canada’s northern reaches. Turetsky’s strategic dialogues 
will follow the Science Café or World Café format designed to bring 
together stakeholders—in this case scientists, indigenous com- 
munity members, and other segments of the public—to “provide 
a forum for sharing ideas on complex issues and gaining mutual 
understanding of the challenges ahead,” said Turetsky. “We will use 
these events as an opportunity to bring stories from the Arctic to 
southern audiences.” 

One of 15 mid-career research scientists selected as 2018-19 
AAAS Leshner Leadership Institute fellows, Turetsky is moving 
forward on one of the program's key objectives in spearheading 
public dialogues about Canada’s northwest. The Arctic region was 
selected as a focus in recognition of Canada’s 2018 presidency of 
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the Group of Seven, the organization of industrialized democracies 
of the United States, Canada, France, the United Kingdom, Germany, 
Italy, and Japan. 

The G7’'s scientific organizations committed to devoting their scien- 
tific expertise to the sustainability of Arctic communities, particularly 
those with changing ocean systems. The societies agreed to share 
research, open data-sharing platforms that build science capacity, 
and continue satellite monitoring of the Arctic region. 


The program’s roots 

Turetsky, along with others, participated in the week-long, inten- 
sive public engagement training sessions beginning on 11 June, a 
program Leshner envisioned as a pathway to perfecting “two-way 
communication between the scientific enterprise and the commu- 
nity it is intended to serve.” 

The Leshner Leadership Institute fellowship was established in 
2015 in honor of Leshner’s long-standing dedication to enhancing 
discourse between scientists and the public following his retire- 
ment as AAAS'’s CEO and executive publisher of the Science family 
of journals and his earlier career as a neuroscientist, psychologist 
practitioner, and science agency administrator. 

Leshner Leadership Institute fellows are called upon to leverage 
their scientific research and interact with the public through a se- 
ries of activities focused on generating bidirectional dialogue about 
the promise and limits of science and technology in addressing 
significant societal issues. This year’s group was selected to apply 
its expertise to food and water security. 

The fellows make up the program's third contingent since its estab- 
lishment. Like their predecessors, participants also share their training 
by mentoring other scientists, helping scientists become strategic in 
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Leshner Leadership Institute 
fellows recently gathered for 
an intensive week of public 
engagement training on food 
and water security. 


their public outreach, and working within 
their academic institutions to establish 
public engagement programs. 

“There is a tremendous need to bring 
the scientific community into closer con- 
tact with the rest of society in the form 
of arespectful, interacting dialogue,” said Leshner. “That's a learned 
skill. That is not an innate skill and therefore there is a tremendous 
need to train people in how to do it. This approach is to train a small 
cadre and then have that cadre, in turn, leverage their experience by 
sharing what they’ve learned with the rest of their community.” 

With that statement, Leshner articulated what has become one of 
the program's central pursuits as it expands its network of scientists 
and engineers who recognize that researchers have an obligation to 
interact meaningfully with the public, particularly on issues of societal 
significance and concern. Significantly, Leshner’s appreciation of 
a missing need came during a webinar presentation prerecorded to 
attract scientists to apply for the inaugural class in the fall of 2015. 
Knowing what the program needed to accomplish before it had gotten 
off the ground revealed how Leshner’s vision has become an applied 
recognition that scientists have an obligation to interact attentively 
with the public. 

The program's goals also represent a response 
to Leshner’s view of the ineffectiveness of a 
previously held proposition that scientists 
need only to educate the public about emerg- 
ing scientific issues to instill public under- 
standing and build support. That approach “is 
not working,” he said of what is known as the 
“deficit model” in a 2003 editorial published 
in Science. 

The Leshner Leadership Institute fellows 
program was developed using relevant find- 
ings from social science research and the experiences of scientists 
and other professionals working in public engagement and science 
communication. It uses a logic model to guide practitioners in 
“intentional interaction with opportunities for mutual, two-way learn- 
ing.” Scientific researchers in the inaugural 2016-17 class focused 
on climate change, a topic of continuing public controversy. The 
second, in 2017-18, pursued the equally contentious area of infectious 
disease at a time of public misconceptions of the value of vaccines 
and risks of global pandemics. 

“After this year’s group completes its work on food and water 
security, the 2019-20 fellows are slated to explore public engage- 
ment around new technologies applied to humans, including artificial 
intelligence, nanotechnology, genetic engineering, and cognitive 
science,” said Emily Cloyd, project director for the AAAS Center for 
Public Engagement with Science and Technology. 


The fellows at work 

Benjamin Preston, director of the RAND Corporation's Infrastruc- 
ture Resilience and Environmental Policy Program and among the 
first group of Leshner Leadership Institute fellows, said the training 
helped him shift his focus to the audience, its views, and the impact 
of climate on its communities and positions when discussing the sci- 
ence of climate change. 

The recitation of “overwhelming evidence of climate change and 
the significant consequences that change is having” often erodes 
an audience's view of a scientist's objectivity, Preston said. A more ef- 
fective approach is one that focuses on the trade-offs. 

“There are many voices talking about the consequences of failing 
to act on climate change, but few that acknowledge that both action 
and inaction have consequences that need to be managed,” said 
Preston. “Rather than trying to defend a scientific finding or policy 
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“We need to engage the 
public in a more open 
and honest bidirectional 
dialogue about science 
and technology.” 


Alan I. Leshner, AAAS CEO Emeritus 


position, | choose to explore the trade-offs associated with different 
courses of action for things people care about.” 

For Maria Elena Bottazzi, associate dean of the National School 
of Tropical Medicine at Baylor College of Medicine and a member of 
the second group examining infectious disease, the fellowship came 
at an auspicious moment. Shortly after the program, Bottazzi was 
bestowed the 2018 Slim Health Award, a lifetime research achieve- 
ment honor in recognition of her two decades of advancing health in 
Latin America and the Caribbean. 

The distinction swept her into a swirl of public communications 
obligations. She was called upon to deliver speeches, conduct dozens 
of press interviews, and work with young Latin American and Carib- 
bean scientists to expand their research capacity, among other com- 
mitments. “My ability to stand out and highlight efforts devoted to 
the dissemination of knowledge and the training of human resources, 
strengthening research capacities in the region, and, consequently, 
consolidating health systems for the benefit of the population has 
been directly related to the skills, tools, and practice | received during 
my year as aAAAS Leshner fellow,” Bottazzi said. 

Bottazzi and Luis Martinez, an infectious microbes research scien- 
tist at the University of Texas, El Paso, also a fellow, organized a science 
communications workshop in Houston to train 
his university’s faculty members and students, 
an activity that contributed to expanding engage- 
ment capacity across research institutions. 


Engagement in the Arctic region 

As part of the fellows’ concentration on food 
and water security, Turetsky, an ecosystem ecol- 
ogist at the University of Guelph in Ontario, has 
devoted her research to understanding the sys- 
tems of the Arctic region, which, after oceans, 
represent the world’s largest ecosystem. 

Her work examines permafrost deterioration and the rising preva- 
lence of wildfires in the Canadian northwest, events that threaten to 
increase the release of atmospheric gases including carbon dioxide 
and methane, emissions linked to climate change. More recently, she 
has focused on soil and water quality and their connection to the 
region's food and water security. 

An example of rising concerns about water security has emerged 
from the once-considered-secure resting place for some 237,000 tons 
of highly toxic arsenic trioxide dust produced by the now-abandoned 
Giant Mine, a gold-mining facility that operated in Yellowknife, capital 
of the Northwest Territories, from 1948 until 2004. It had no pollution 
controls in its first 3 years of operation. Some 16,500 pounds of toxic 
dust a day spewed out of the smelters until 1953, settling on lakes and 
atop the area’s terrain, according to a research report funded by the 
Social Sciences and Humanities Research Council of Canada. 

Eventually, the toxic debris was captured and buried in storage 
chambers and underground rocky spaces from which gold was 
once extracted. The debris stockpiles have become unstable and 
threaten Yellowknife’s groundwater supply. A remediation plan to 
freeze the stored toxic waste is being pursued after years of scien- 
tific examination. 

Turetsky’s work in both the research and public discussion of these 
issues demonstrates the range of challenges being addressed by AAAS 
Leshner Leadership Institute fellows, current and past, and the increas- 
ing need for effective engagement between scientists and the public. 

As Leshner wrote in his 2003 Science editorial, “We need to engage 
the public in a more open and honest bidirectional dialogue about 
science and technology and their products, including not only their 
benefits but also their limits, perils, and pitfalls. We need to respect 
the public's perspective and concerns even when we do not fully share 
them, and we need to develop a partnership that can respond to them.” 
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AAAS NEWS & NOTES 


AAAS S&T Policy Forum 
explores U.S. competitiveness, 
now and in the future 


The R&D ecosystem needs change to reap 
benefits from emerging innovation 


By Anne Q. Hoy 


The U.S. scientific research and development enterprise stands at a 
critical juncture that requires the convergence of basic research and 
commercial applications to more quickly deliver benefits of next- 
generation technological innovation, said Arati Prabhakar in a lecture 
at the American Association for the Advancement of Science's 
Forum on Science & Technology Policy. 

Prabhakar, a fellow at the Center for Advanced Study in the Be- 
havioral Sciences at Stanford University and former director of both 
the Defense Advanced Research Projects Agency (DARPA) and the 
National Institute of Standards and Technology, also called on the 
R&D community to leverage advances in the social sciences to meet 
challenges of the coming decades. 

Change was a theme that ran through the 43rd annual S&T Policy 
Forum on 21-22 June at AAAS headquarters in Washington, D.C., 
focused on U.S. science, innovation, and competitiveness. The forum 
featured speakers whose careers span public service, academia, and 
the private sector, including Susan Hockfield, president emerita of 
the Massachusetts Institute of Technology and chair of the AAAS 
board, and Peter Lee, corporate vice president of artificial intelli- 
gence and research at Microsoft. 

“We need new kinds of advances and new ways of working if we 
are to contribute to our nation for the next handful of decades as 
we have for the decades just passed,” said Prabhakar, whose career 
also has traversed Silicon Valley technological start-ups and venture 
capital firms. “That's what it will take if we are to renew and fulfill R&D's 
promise of a better future for our country.” 
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Arati Prabhakar of 
Stanford delivers lecture 
at AAAS Science & 
Technology Policy Forum. 


Closing the gap between high-quality, 
multidisciplinary basic research and the 
private sector’s skillful talent for commercial 
success Calls for novel approaches. Start-up 
incubators such as California's Cyclotron 
Road and The Engine at MIT give innovators time and facilities to turn 
basic research into commercial products, she said. 

The convergence of social and behavioral sciences with mathemat- 
ics, digital technology, and cryptographic methods also is producing 
technologies. Prabhakar pointed, for instance, to a research collabo- 
ration between University of Washington scientists and Enlearn, an 
educational nonprofit, that is advancing adaptive learning for individ- 
ual students by linking their responses with visual content. Likewise, 
a Stanford medical team identified end-of-life care practices able to 
curb health care costs. 

Accelerating R&D development was at the heart of Prabhakar’s 
William D. Carey Lecture, an annual address begun in 1989 to honor 
former AAAS executive officer Carey by presenting speakers who 
exemplify his leadership on public policy issues. 

On that point, Prabhakar said ethical and societal implications 
of new technologies are not to be overlooked. “We need to play our 
part in helping our society make wise choices about the fruits of our 
labor,” she said. 

Microsoft's Lee looked to R&D’s future by tracing his journey from 
disappointing his father, a physics professor, and his mother, a chem- 
istry professor, for pursuing mathematics at the University of Michigan 
and being drawn “to the crystalline beauty of some mathematical 
foundations in computer science.” 

“It was the stereotypical hard-core art, science Asian upbringing 
that you might read about in the papers,” Lee deadpanned. He went 
on to establish a technology office at DARPA and head Carnegie Mel- 
lon University’s computer science department. 

The technology sector’s growth, Lee said, is so exponential that 
it may call for a “fourth paradigm,” to expand science beyond the 
experimental, theoretical, and computational sciences to data-driven 
science. One reason: the growth of data centers able to serve, store, 
and connect to an expanding range of digital tools through networks, 
a move away from giant computers. 

Microsoft's network more than doubles each year, Lee said, noting 
that it has added more than 250 data centers, including the June 
launch of an experimental, submerged data center on the seafloor 
off the Orkney Islands at Scotland's northern point. It is powered 
through tidal forces, is constructed of recycled materials, and con- 
nects to a modest 850 servers. 

“We see these sorts of inflection points very infrequently,” said 
Lee, discussing the growth of hyperscale data computing, the emer- 
gence of large-scale machine learning, and the possible inclusion of 
a new data-driven science paradigm. “We may in fact be in a similar 
type of inflection point today in the same way that the invention of 
mass-produced movable type, the Gutenberg press, was an indus- 
trial foundation for human knowledge.” 
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How to make vinyl cations 


Popov et al., p. 381 


IN SCIENCE JOURNALS Wien eiecnadis 


EUSOCIALITY 
The benefits of being 
well fed 


n eusocial insects, the vast 

majority of individuals sacrifice 

their reproductive potential 

to support the reproductive 

queen. Although this system 
has evolved repeatedly, there is 
still much debate surrounding its 
origin. Working with seven differ- 
ent species of ants, Chandra et al. 
used a transcriptomic approach 
to show that a single gene is 
consistently up-regulated in 
queens. This gene seems to con- 
fer reproductive status through 
integration with increased nutri- 
tion. Ina clonal ant, larval signals 
disrupt this gene up-regulation, 
destabilizing the division of repro- 
ductive labor. Increasing levels of 
the associated peptide override 
these larval signals and establish 
eusociality. -SNV 


Science, this issue p. 398 


DEVICE TECHNOLOGY 
Cooler electrons 
for transistors 


The operating power of field- 
effect transistors is constrained 
in part by the minimum change 
in voltage needed to change the 
current output. This subthresh- 
old swing (SS) limit is caused by 
hotter electrons from a thermal 
electron source leaking over the 
potential of the gate electrode. 
Qiu et al. show that graphene 
can act as a Dirac source that 
creates a narrower distribu- 
tion of electron energies. When 
coupled to a carbon nanotube 
channel, the decrease in SS 
would allow the supply voltage 
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to be decreased from 0.7 to 0.5 
volts. —PDS 


Science, this issue p. 387 


MICROBIOTA 
Sticky bacteria tolerated 
as future food 


Dictyostelium discoideum 
amoebae consume bacteria until 
the supply is exhausted. Then 
the amoeba cells clump together 
into a “slug” and initiate a com- 
plex multicellular reproductive 
phase. Specialized cells within 
aggregates rid the slug of any 
extracellular bacteria. However, 
some strains of amoeba tolerate 
live, intracellular bacteria. Dinh 


et al. discovered that these car- 
rier strains bear surface lectins 
that bind Klebsiella bacteria, pro- 
mote cell entry, and prevent the 
bacteria from being immediately 
digested. These bacteria then 
provide a future food source. 
Moreover, the internalized bacte- 
ria transfer DNA into the amoeba 
nucleus, resulting in transient 
genetic transformation. —CA 
Science, this issue p. 402 


NEUROSCIENCE 
Support for the memory 
index theory 


The link between contextual 
memory representations and 
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Atranscriptomics approach 
reveals the underpinnings of 
queen versus worker ants. 


locations or routes represented 
by hippocampal place cells 
during exploration remains 
unknown. Tanaka et al. examined 
spatial firing properties of neu- 
rons in hippocampal area CA1 
on the basis of whether they had 
recently expressed the immedi- 
ate-early activity-induced gene 
c-Fos in response to a novel con- 
text. The c-Fos—positive neurons 
displayed a more on-off firing 
pattern than the c-Fos—negative 
cells during context discrimina- 
tion. In a contextual recognition 
paradigm, these results support 
the index theory of hippocampal 
function over a cognitive map- 
ping theory. —PRS 


Science, this issue p. 392 
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GUT INFLAMMATION 
Inflammation promotes 
gut polyposis 
Peutz—Jeghers Syndrome (PJS) 
causes benign polyps in the gut 
and a higher risk of several can- 
cers caused by mutations in the 
tumor suppressor gene STK11, 
which encodes liver kinase B1 
(LKB1). LKB1's role in this dis- 
ease is thought to be related to 
its tumor suppressor function. 
Now, Poffenberger et al. 
show that the T cell-specific 
heterozygous deletion of Stk1l1 
is sufficient to reproduce PJS 
symptoms in mice (see the 
Perspective by Hollstein and 
Shaw). Polyps in mice and 
humans are characterized by 
immune cell infiltration, 
enhanced STAT3 signaling, and 
increased levels of inflammatory 
cytokines such as interleukin-6 
(IL-6). Targeting STATS signal- 
ing, IL-6, or T cells ameliorated 
the polyps, suggesting potential 
therapies for this disease. —STS 
Science, this issue p. 406; 
see also p.332 


CLIMATOLOGY 
Colder Siberian winters 
in a warming world 


Winters have been more severe 
in Siberia over the past decade, 
even while loss of sea ice indi- 
cates dramatic warming of the 
Arctic. One hypothesis pro- 
posed to explain this apparent 
paradox is that Arctic sea ice 
loss forces a change in atmo- 
spheric circulation patterns 
that leads to colder Siberian 


winters. Using comprehensive 
atmospheric circulation models, 
Zhang et al. demonstrate that 
late autumn sea ice loss in the 
Barents-Kara Seas should result 
in acolder climate in Siberia 
the following winter. Their 
experiments also suggest that 
changes in the stratosphere, 
particularly shifts in and weak- 
ening of the stratospheric polar 
vortex owing to a reduction in 
sea ice, contribute substantially 
to the severity of winters in 
Siberia. —KVH 
Sci. Adv. 10.1126/ 
sciadv.aat6025 (2018). 


CANCER 
Blocking activin actively 
treats cancer 


Platinum-based chemotherapy 
is a mainstay of treatment for 
lung cancer. However, resistance 
to this therapy is common, as 
are dose-limiting side effects, 
particularly kidney toxicity. 
To search for mechanisms 
contributing to treatment resis- 
tance, Marini et al. performed a 
whole-genome RNA interference 
screen, which implicated the 
activin pathway. Inhibition of this 
pathway offered a dual benefit 
by potentiating the effects of 
platinum drugs in mouse models 
of cancer and protecting the 
animals from kidney damage. 
Thus, activin inhibitors could be 
a valuable addition to platinum 
chemotherapy, enhancing the 
efficacy of treatment while also 
allowing higher doses or longer 
periods of drug use. —YN 

Sci. Transl. Med. 10, eaat3504 (2018). 
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: Beach-filtered intake systems could decrease the 
impact of power plant draws on aquatic ecosystem 


EDUCATION 
Taking anxiety out 


of active learning 


As STEM (science, technology, 
engineering, and mathemat- 
ics) education becomes more 
centered on active learn- 

ing practices, what happens 

to students’ anxiety levels? 
Specifically, what aspects 

of evaluative active learning 
practices cause student anxiety 
to increase or decrease? After 
measuring students’ baseline 
anxiety levels, Cooper et al. 
conducted semi-structured 
interviews to explore how 
students’ anxiety levels were 
altered in an active learning 
classroom. Results show that 
the way that the active learn- 
ing activity is implemented and 


Published by AAAS 


the extent to which students 
perceive the activity to be bene- 
ficial influence its effect on their 
anxiety. The authors encourage 
instructors to consider student 
anxiety when implementing 
active learning. —MMc 
Int. J. STEM Educ. 10.1186/s40594-018- 
0123-6 (2018). 


MOLECULAR BIOLOGY 


tRNA lost in translation 
Transfer RNAs (tRNAs), the 
adaptor molecules between 
messenger RNAs (mRNAs) and 
ribosomes during translation, 
are subjected to various types 
of chemical modifications, one 
of which is N’-methyguanosine 
(m/’G). Mutations in the human 
m’G methyltransferase complex 
lead to developmental disorders 
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such as microcephalic pri- 
mordial dwarfism and Down 
syndrome. Lin et al. mapped 
the m’G tRNA methylome at 
single-nucleotide resolution 
and demonstrated its essential 
role in mouse embryonic stem 
cells. Depletion of members 

of the m’G methyltransferase 
complex resulted in increased 
ribosome pausing on, and inef- 
ficient translation of, mRNAs 
involved in the cell cycle and 
brain development, thereby 
disrupting differentiation to 
neural lineages. This study is an 
important step toward a fuller 
understanding of how defects 
in tRNA methylation cause 
neurodevelopmental disorders. 
—SYM 


Mol. Cell 10.1016/ 
j.molcel.2018.06.001 (2018). 


SCIENCE sciencemag.org 


BIOMATERIALS 
Using biology 
to remove cells 


Decellularization is used to 
obtain extracellular matrix 
(ECM) scaffolds for use in tissue 
engineering and regenerative 
medicine, but the methods for 
stripping out the cells can be 
chemically harsh and cause 
damage to the ECM components 
or their overall architecture. 
Cornelison et al. treated 
peripheral nerve tissue with 
camptothecin, which triggers 
apoptosis, causing cell death and 
degradation. The fragments and 
any remaining cytotoxic com- 
pounds were removed with only 
a mild wash using saline solution 
or deoxyribonuclease, leaving 
the nerve scaffold intact without 


substantive loss of collagen or 
glycosaminoglycan content. 
In vivo testing in rats found no 
immunogenic response and a 
lower histological score than in 
isograft negative controls after 4 
weeks. —MSL 
Acta Biomat. 10.1016/ 
j-actbio.2018.07.009 (2018). 


NEURODEGENERATION 
Alzheimer’s disease 
in a dish? 
Alzheimer’s disease is char- 
acterized by extracellular 
deposits of B-amyloid (Ap) 
and intracellular aggregation 
of hyperphosphorylated tau 
in the brain. These lead to the 
hyperactivation of glial cells 
in the brain and neuronal loss. 
However, it has been difficult 
to serially link this series of 
pathological events. Park et al. 
developed a three-dimensional 
triculture system composed 
of human-derived neurons, 
astrocytes, and microglia. The 
model demonstrated hall- 
mark features of Alzheimer’s 
disease and AB aggregation, 
phospho-tau accumulation, 
and neuroinflammation. The 
ability to see ongoing microglia 
recruitment in the microflu- 
idic platform and consequent 
neurotoxicity bodes well for the 
utility of this type of approach 
as a platform for dissect- 
ing pathomechanisms and 
preliminary testing of potential 
therapeutics. —SMH 

Nat. Neurosci. 21,941 (2018). 


CANCER IMMUNOLOGY 
Neutrophils take 


a different tack 


Antibody-dependent cellular 
cytotoxicity is an important 
process by which cancer cells 
can be targeted and killed. 
Immune cells, such as T and 
NK (natural killer) cells, express 
Fc receptors that recognize 
cancer cells coated with 
antibodies. They then release 
cytotoxic granules that result 
in apoptosis. Matlung et al. 
show that neutrophils induce 
antibody-dependent cellular 
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cytotoxicity via a completely dif- 
ferent mechanism. This process 
entails trogocytosis-induced 
lysis of tumor cells, which the 
authors term “trogoptosis.” They 
find that neutrophils endocytose 
cytoplasmic fragments of target 
cells, which induces cancer cell 
necroptosis. In addition to direct 
killing, this mechanism may 
be important for the release 
of cancer neoantigens and 
damage-associated molecular 
patterns, which further activate 
and direct the immune response 
to tumors. —STS 

Cell Rep. 23, 3946 (2018). 


MARTIAN GEOLOGY 
Ghost dunes spotted 
on Mars 


Sand dunes are imposing physi- 
cal structures, but few signs 

of them are preserved in the 
geological record because their 
grains do not stick together and 
are moved by wind. Preservation 
can occur if a dune field is 
partially buried by lava or mud, 
which then forms a layer of 

hard rock. Subsequent erosion 
of the sand leaves holes where 
the dunes once were. These fea- 
tures, termed ghost dunes, were 
only recently identified on Earth. 
Day and Catling used images 
from the Mars Reconnaissance 
Orbiter spacecraft to identify 
two locations on Mars where 
there are hundreds of ghost 
dunes. Revealing long-vanished 
dunes will enable studies of 
Mars’ ancient winds. —KTS 


J. Geophys. Res. Planets 
10.1029/2018JE005613 (2018). 


Voids in martian surface strata show 
where sand dunes once existed. 
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MOLECULAR BIOLOGY 
Phase separation and 
gene control 


Many components of eukaryotic 
transcription machinery—such 
as transcription factors and 
cofactors including BRD4, sub- 
units of the Mediator complex, 
and RNA polymerase II|—contain 
intrinsically disordered low- 
complexity domains. Now a 
conceptual framework connect- 
ing the nature and behavior of 
their interactions to their func- 
tions in transcription regulation 
is emerging (see the Perspective 
by Plys and Kingston). Chong 
et al. found that low-complexity 
domains of transcription factors 
form concentrated hubs via 
functionally relevant dynamic, 
multivalent, and sequence-spe- 
cific protein-protein interaction. 
These hubs have the potential 
to phase-separate at higher 
concentrations. Indeed, Sabari 
et al. showed that at super- 
enhancers, BRD4 and Mediator 
form liquid-like condensates that 
compartmentalize and concen- 
trate the transcription apparatus 
to maintain expression of key 
cell-identity genes. Cho et al. 
further revealed the differential 
sensitivity of Mediator and RNA 
polymerase || condensates to 
selective transcription inhibitors 
and how their dynamic interac- 
tions might initiate transcription 
elongation. —SYM 

Science, this issue p. 378, p. 379, 

p. 412; see also p. 329 


BIOTECHNOLOGY 
Transcriptome mapping 
in the 3D brain 


RNA sequencing samples the 
entire transcriptome but lacks 
anatomical information. In situ 
hybridization, on the other hand, 
can only profile a small number 
of transcripts. In situ sequenc- 
ing technologies address these 
shortcomings but face a chal- 
lenge in dense, complex tissue 
environments. Wang et al. com- 
bined an efficient sequencing 


377-B 


approach with hydrogel-tissue 
chemistry to develop a mul- 
tidisciplinary technology for 
three-dimensional (3D) intact- 
tissue RNA sequencing (see the 
Perspective by Knopfel). More 
than 1000 genes were simulta- 
neously mapped in sections of 
mouse brain at single-cell resolu- 
tion to define cell types and 
circuit states and to reveal cell 
organization principles. -SYM 
Science, this issue p. 380; 
see also p. 328 


CLIMATE 
Improving climate 
model projections 


How much will Earth's atmo- 
sphere warm if the carbon 
dioxide concentration doubles? 
Climate models give different 
answers to this question, mak- 
ing it difficult to determine how 
much carbon can be emit- 

ted while avoiding dangerous 
levels of climate change. Ina 
Perspective, Soden et al. argue 
that the radiative forcing (i.e., the 
warming) from carbon dioxide is 
much less uncertain than model 
results suggest. The uncertainty 
is reduced substantially when 
amore sophisticated method 

is used to calculate the forcing, 
but models cannot adopt this 
method because it is too com- 
putationally expensive. Ensuring 
that model parameterizations 
are vetted against the more 
accurate calculations would help 
to reduce uncertainty in this 
crucial component of climate 
modeling. —JFU 


Science, this issue p. 326 


CANCER THERAPY 
Modeling chemotherapy- 
drug combinations 


Pharmacological inhibitors 

of DNA repair pathways 

can enhance the efficacy of 
chemotherapy. Alkan et al. 
modeled cancer cell responses 
to various chemotherapies and 
tested the efficacy of combining 
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chemotherapy with DNA repair 
inhibition. In addition to the 
expected synergy between 
chemotherapy and an inhibitor 
of ATR (a DNA repair pathway 
kinase) in cultured cells and in 
vivo, there were some unex- 
pected differences in outcome 
when applying low- versus high- 
dose chemotherapy regimens. 
Thus, modeling in cells can help 
inform therapeutic development. 
—LKF 

Sci. Signal. 11, eaat0229 (2018). 


T CELL DIFFERENTIATION 
Signal strength seals fate 


T cell differentiation into effector 
and memory T cell subsets is 
influenced by T cell receptor 
(TCR) signals. Snook et al. exam- 
ined how TCR signals influence 
CD4* T cell differentiation by 
using a panel of cloned TCRs 
that recognize the same MHC 
(major histocompatibility com- 
plex) class Il-restricted epitope 
of lymphocytic choriomeningitis 
virus. Strong TCR signals were 
associated with T helper (T,,1) 
differentiation, whereas lower 
TCR signal strength corre- 
sponded with follicular helper T 
(Tfh) and memory T cell differ- 
entiation. Low CD25 expression 
by early effector T cells also 
predicted memory differentia- 
tion, although CD25 expression 
levels were not predictive of 
recall responses. Enhanced 
TCR signaling via knockdown of 
SHP-1 favored T,,1 over Tfh and 
memory T cell differentiation. 
Thus, stronger TCR signaling 
promotes terminal effector T,,1 
differentiation. —CNF 

Sci. Immunol. 3, eaas9103 (2018). 


ORGANIC CHEMISTRY 
Silicon smooths the way 
to vinyl cations 


Saturated carbon centers often 
undergo substitution reactions 
by initial cleavage of a carbon- 
halogen or carbon-oxygen 
bond, which leaves the carbon 
positively charged. Analogous 


cations have proven much 
harder to access from vinylic 
carbons that are engaged in 
double bonds. Popov et al. now 
show that silicon cations paired 
with noncoordinating anions 
can pull triflate groups off such 
vinylic carbons under ambient 
conditions (see the Perspective 
by Kennedy and Klumpp). The 
resultant vinyl cations react 
with simple alkanes through 
C-H insertion. Theoretical and 
mechanistic studies suggest 
that these reactions proceed 
through nonclassical pathways 
that bifurcate after the transition 
state. —JSY 

Science, this issue p. 381; 

see also p. 331 
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MOLECULAR BIOLOGY 


Imaging dynamic and selective 
low-complexity domain interactions 
that control gene transcription 


Shasha Chong, Claire Dugast-Darzacq, Zhe Liu, Peng Dong, Gina M. Dailey, 
Claudia Cattoglio, Alec Heckert, Sambashiva Banala, Luke Lavis, 


Xavier Darzacq, Robert Tjian* 


INTRODUCTION: DNA binding transcription 
factors (TFs) are quintessential regulators of 
eukaryotic gene expression. Early studies of 
TFs revealed their well-structured DNA binding 
domains (DBDs) and identified functionally crit- 
ical activation domains (ADs) required for tran- 
scription. It later became evident that many ADs 
contain intrinsically disordered low-complexity 
sequence domains (LCDs), but how LCDs activate 
transcription has remained unclear. Although 
it is known that transcriptional activation by 
LCDs requires selective interaction with bind- 
ing partners, it has been challenging to direct- 
ly measure selective LCD-LCD recognition in 
vivo and unravel its mechanism of action. 


RATIONALE: Traditional biochemical recon- 
stitution and genetics studies have identified 
most of the molecular players central to tran- 
scription regulation. However, the mechanism 
by which weak, dynamic protein-protein interac- 
tions drive gene activation in living cells has 
remained unknown. Advances in live-cell single- 
molecule imaging have opened a new frontier 
for studying transcription in vivo. In this study, 
we used synthetic LacO (Lac operator) arrays 
as well as endogenous GGAA microsatellite 
loci to study LCD-LCD interactions of TFs 
such as EWS/FLI, TAF15, and Sp1 in live 
cells. To probe the dynamic behavior of TF 
LCDs at target genomic loci, we have combined 


«> Dynamic transcription factor (TF) interactions 
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> Multivalent interactions 


Gave © 


Mini hubs 
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From hubs to phase separation: Activation occurs in a wide range of TF concentrations. |n 
vivo LCD-dependent transactivation occurs in hubs formed over a broad range of TF concentrations 
(100 nM to 100 uM) and time scales (<1 s to minutes). At endogenous concentrations, TF LCDs form 
transactivation hubs at native genomic loci without undergoing evident phase separation. Upon TF 
LCD overexpression, phase separation is observed at synthetic TF binding site arrays. 
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CRISPR-Cas9 genome editing, mutagenesis, 
gene activation, cell transformation assays, 
and various high-resolution imaging approaches 
including fluorescence correlation spectroscopy, 
fluorescence recovery after photobleaching, lat- 
tice light-sheet microscopy, three-dimensional 
DNA fluorescence in situ hybridization, and 
live-cell single-particle tracking. 


RESULTS: Live-cell single-molecule imaging 
revealed that TF LCDs interact to form local 
high-concentration hubs at both synthetic DNA 
arrays and endogenous genomic loci. TF LCD 
hubs stabilize DNA binding, recruit RNA poly- 
merase II (RNA Pol ID, and activate transcrip- 
tion. LCD-LCD interactions within hubs are 
highly dynamic (seconds to minutes), selective for 
binding partners, and differentially sensitive to 

disruption by hexanediols. 
These findings suggest that 
Read the full article under physiological condi- 
at http://dx.doi. tions, rapid, reversible, and 
org/10.1126/ selective multivalent LCD- 
science.aar2555 LCD interactions occur be- 
tween TFs and the RNA 
Pol II machinery to activate transcription. We 
observed formation of functional TF LCD hubs 
at a wide range of intranuclear TF concentra- 
tions. Although we detected apparent liquid- 
liquid phase separation with gross overexpression 
of LCDs, transcriptionally competent TF LCD 
hubs were observed at physiological TF levels at 
endogenous chromosomal loci in the absence of 
detectable phase separation. In addition, muta- 
genesis, gene expression, and cell transforma- 
tion assays in Ewing’s sarcoma cells revealed 
a functional link between LCD-LCD inter- 
actions, transactivation capacity, and oncogenic 
potential. 


CONCLUSION: The use of various imaging 
methods in live cells powerfully complements 
in vitro studies and provides new insights into 
the nature of LCD interactions and their role 
in gene regulation. We propose that trans- 
activation domains function by forming local 
high-concentration hubs of TFs via dynamic, 
multivalent, and specific LCD-LCD interactions. 
It also seems likely that weak, dynamic, and 
transient contacts between TFs play a role 
in disease-causing dysregulation of gene ex- 
pression (i.e., EWS/FLI1 in Ewing’s sarcoma), 
suggesting that LCD-LCD interactions may 
represent a new class of viable drug targets. 
Although we examined a small subset of TF 
LCDs, the principles uncovered regarding 
the dynamics and mechanisms driving LCD- 
LCD interactions may be applicable to other 
classes of proteins and biomolecular inter- 
actions occurring in many cell types. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: jmlim@berkeley.edu 

Cite this article as S. Chong et al., Science 361, eaar2555 
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Imaging dynamic and selective 
low-complexity domain interactions 
that control gene transcription 


Shasha Chong’”, Claire Dugast-Darzacq’*, Zhe Liu*, Peng Dong“, Gina M. Dailey’, 
Claudia Cattoglio’”, Alec Heckert’, Sambashiva Banala*, Luke Lavis*, 


Xavier Darzacq’*, Robert Tjian’”’?* 


Many eukaryotic transcription factors (TFs) contain intrinsically disordered low-complexity 
sequence domains (LCDs), but how these LCDs drive transactivation remains unclear. We used 
live-cell single-molecule imaging to reveal that TF LCDs form local high-concentration 
interaction hubs at synthetic and endogenous genomic loci. TF LCD hubs stabilize DNA binding, 
recruit RNA polymerase II (RNA Pol Il), and activate transcription. LCD-LCD interactions 
within hubs are highly dynamic, display selectivity with binding partners, and are differentially 
sensitive to disruption by hexanediols. Under physiological conditions, rapid and reversible 
LCD-LCD interactions occur between TFs and the RNA Pol II machinery without detectable 
phase separation. Our findings reveal fundamental mechanisms underpinning transcriptional 
control and suggest a framework for developing single-molecule imaging screens for drugs 
targeting gene regulatory interactions implicated in disease. 


equence-specific DNA binding transcrip- 
tion factors (TFs) are preeminent players 
in eukaryotic gene regulation. From the 
earliest studies of human TFs, it was rec- 
ognized that regulatory proteins such as 
Spl contain well-structured DNA binding do- 
mains (DBDs) and functionally critical trans- 
activation domains that participate in specific 
TE-TF interactions to direct gene transcription 
(J-3). Numerous atomic structures of DBDs have 
provided a concrete understanding of TF-DNA 
interactions. In contrast, many transactivation 
domains contain low-complexity sequence do- 
mains (LCDs) that persist in an intrinsically 
disordered conformation not amenable to con- 
ventional structural determination. Mutations 
in TF LCDs not only disrupt transcription but 
also have been implicated in cancer and neuro- 
degenerative disorders (4, 5). However, the 
mechanism by which TF LCDs execute specific 
transactivation functions has remained an enig- 
ma. Elucidation of how LCDs operate in vivo, 
given the dynamic nature of TF-TF interactions 
required for gene regulation, has been an equally 
challenging problem. 
Several in vitro studies have suggested that 
purified LCDs from the FET protein family (FUS, 
EWS, and TAF15) can undergo reversible hydrogel 
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formation or liquid-liquid phase separation at 
high concentrations and low temperatures (6-8). 
Moreover, the C-terminal domain of RNA poly- 
merase IT (RNA Pol II) is itself an LCD and can 
undergo phase separation (9) and be incorporated 
into FET LCD hydrogels in a phosphorylation- 
regulated manner (J0). FET LCDs were also re- 
ported to undergo phase separation in live cells 
upon overexpression (7, 17). 

However, stark differences exist between 
in vivo physiological conditions and those used 
for in vitro or overexpression studies. Temper- 
ature, protein concentration, purity, and micro- 
environment may all substantially affect the 
behavior of LCDs. There is also a vigorous debate 
as to whether LCDs undergo cross-8 polymer- 
ization or remain in a disordered conformation 
when interacting with partners (6-8, 11-17). 
From the perspective of elucidating how TFs 
work in vivo, an equally pressing unresolved 
mechanistic question concerns the dynamics 
and time scales governing LCD-LCD interac- 
tions that would allow TFs to function in rapid 
cellular processes. Selectivity of cognate LCD- 
LCD interactions is another important yet poorly 
understood feature that is required for proper 
TF function in vivo. Thus far, selective LCD-LCD 
recognition has not been directly demonstrated 
in vivo, let alone understood at a mechanistic 
level. In this study, we combined a variety of high- 
resolution imaging strategies—including fluo- 
rescence correlation spectroscopy (FCS) (18, 19), 
fluorescence recovery after photobleaching (FRAP) 
(20), lattice light-sheet microscopy (27), three- 
dimensional (3D) DNA fluorescence in situ hybrid- 
ization (FISH) (22), and live-cell single-particle 


tracking (SPT) (23, 24)—to probe the dynamic 
behavior of TF LCDs at target genomic loci under 
physiological conditions. 


Synthetic LacO arrays mediate 
formation of LCD interaction hubs 


We first established proof-of-concept experi- 
ments by using a synthetic Lac operator (LacO) 
array (~50,000 LacO repeats) integrated into the 
genome of human U20OS cells (25) that express 
various enhanced yellow fluorescent protein 
(EYFP)-tagged TF LCDs fused to the Lac repressor 
(Lacl) (Fig. 1A). To probe potential sequence 
specific LCD-LCD interactions, we examined two 
distinct classes of LCDs: QGYS-rich LCDs from 
the FET family and a QGTS-rich LCD from Sp1 
that is low in tyrosine (table S1) (Q, Gln; G, Gly; Y, 
Tyr; S, Ser; T, Thr). 

As expected, the LacO array recruits a large 
number of EYFP-LCD-LaclI molecules via targeted 
DNA binding, forming a concentrated local inter- 
action hub in the nucleus. LacO-associated hubs 
formed by LCD-LaclI but not Lacl are visible by 
bright-field microscopy (Fig. 1B), suggesting that 
the refractive index and mass density of LCD- 
Lacl hubs differ considerably from the surround- 
ing nuclear environment. 

We found that both TAF15-LacI and Sp1-LacI 
give rise to much brighter and larger LacO- 
associated LCD hubs than LaclI alone. To quan- 
tify this effect, we used two orthogonal methods 
to measure the protein concentrations in live cells. 
Specifically, we performed fluorescence intensity 
and FCS measurements of intracellular protein 
and compared the results with standard concen- 
tration curves of a purified fluorescent tag (EYFP 
or mCherry) calibrated by the same method (fig. 
S1). Next, we estimated the absolute protein copy 
number in each hub by using the average in-hub 
protein concentration (fig. S2A) and hub dimen- 
sions measured in single-cell images (fig. S2B). 
The two independent concentration measure- 
ments consistently showed that the LCD-LacI 
copy number in the LacO-associated hub increases 
with the TF nuclear concentration much faster 
than with LacI alone and reaches levels up to 
orders of magnitude greater than the number 
of LacO repeats available for direct LacO-Lacl 
binding (Fig. 1C and fig. S2, C and D), suggest- 
ing likely cooperative multivalent interactions 
at the LacO array that is contributed by extensive 
LCD self-interactions. Similarly, smaller LacO ar- 
rays containing substantially fewer (~15,000) LacO 
repeats similarly nucleate LCD self-interactions 
(Fig. 1D and fig. S2, E and F). 

Moreover, LCD-LaclI but not LacI alone can 
form hundreds of smaller puncta throughout 
the nucleus once its intranuclear concentration 
reaches a certain threshold (Figs. 1B and 2A, and 
fig. S2, G and H). LCDs can form intranuclear 
puncta in some cases even without being fused to 
a DBD such as Lacl (Fig. 3A, bottom). These re- 
sults suggest that LCD-LCD interactions can 
promote self-assembly of LCD hubs upon over- 
expression without assistance from DNA (7, 11). 

In addition, FRAP dynamics of LCD-LaclI at 
the LacO array was also significantly different 
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from that of LacI (Fig. 1E). Because diffusion 
contributes negligibly to the FRAP dynamics (fig. 
83, A and B), such differences can be attributed 
to changes in dissociation rates. Specifically, when 
we fit the FRAP curves with a reaction-dominant 
model (26), we found that fusing TAFI5 or FUS 
LCD to Lacl leads to more than 60% reduction in 
the dissociation rate constant of Lacl (fig. $3, C 
to F). This result suggests that at increased local 
TF concentrations, TF LCD hubs driven by LCD- 
LCD interactions stabilize TF binding to its 
cognate genomic site via multivalent contacts 
that could include both DBDs for tethering to 
chromatin and one or more LCDs within TF 
proteins that can form multiple transient inter- 
actions with different partner proteins. 
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LCD hubs interact with RNA Pol Il 

Having demonstrated homotypic LCD self- 
interactions, we next investigated the potential 
role of heterotypic LCD-LCD interactions in hub 
formation. First, we tested whether TF LCD hubs 
can interact with RNA Pol II in vivo by using a 
LacO-containing U2OS line in which we replaced 
the endogenous RPB1 (major and catalytic sub- 
unit of RNA Pol II) with an o-amanitin-resistant 
Halo-tagged RPB1 (27). We subsequently labeled 
the cells with a fluorescent HaloTag ligand and 
visualized RNA Pol II distribution in vivo. We 
found that mCherry-FET-Lacl expression mediates 
significant enrichment of RNA Pol II in LacO- 
associated hubs compared with background levels 
recorded using LacI alone (Fig. 1, F and G, and 


fig. S4, A to C). Moreover, self-assembled LCD- 
Lacl hubs that are unaffiliated with a LacO array 
also enrich RNA Pol II (fig. S4D), suggesting that 
LCD hubs interact with RNA Pol II potentially 
without assistance from DNA. While recapitulat- 
ing the in vitro incorporation of RNA Pol II into 
LCD hydrogels (10), these experiments go one step 
further and suggest that LCD hub formation can 
facilitate the recruitment of the general transcription 
machinery in vivo—a key step toward transactivation. 


LCD-LCD interactions are sequence 
specific and differentially sensitive to 
hexanediol disruption 


Next, we probed the sequence specificity of inter- 
actions between various classes of LCD. To this 
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Fig. 1. A LacO array can mediate the formation 
of an LCD hub in live cells, which involves 
extensive LCD self-interaction and recruits RNA 
Pol Il. (A) Schematic for a LacO array (n = 50,000 
repeats for array 1, n = 15,000 repeats for array 2) 
in the U2OS genome nucleating an LCD hub 
when EYFP-LCD-Lacl is transiently expressed. 
Alternatively, EYFP-Lacl is expressed as a control. 
NLS, nuclear localization signal. (B) Confocal 
fluorescence and bright-field images of 
LacO-containing U2OS cells where LacO array 1 
(highlighted by circles) is bound by EYFP-labeled 
LCD-Lacl or Lacl. LCD-Lacl—bound, but not 
Lacl-bound, LacO arrays are visible in bright-field 
images. (C and D) Copy number of EYFP-labeled 
(C) or mCherry-labeled (D) TAF15 LCD-Lacl 

(red) or Lacl (blue) molecules bound to LacO 
array 1(C) or 2 (D) as a function of mean nuclear 
concentration of the TF. Concentrations were 
measured by fluorescence intensity comparison. 
Each dot represents one cell. (E) Averaged FRAP 
curves at LacO array 1 bound by mCherry-labeled 
TAF15 LCD-Lacl (red) or Lacl (blue). Error 

bars represent SD. a.u., arbitrary units; 

N, number of cells analyzed. (F) (Top) Schematic of 
the proteins expressed in the LacO-containing 
U20S line. (Bottom) Confocal fluorescence 
images show that Halo-RPB1 (labeled with 

200 nM Halo ligand JF500, green) is enriched at 
LacO array 2 bound by mCherry-EWS LCD-Lacl 
(red). (G) (Left) Averaged Halo-RPB1 images 

at LacO array 2 bound by mCherry-labeled 

Lacl, EWS LCD-Lacl, FUS LCD-Lacl, or TAF15 
LCD-Lacl (N = 55, 69, 81, or 143). (Right) 
Fluorescence intensity of Halo-RPB1 at the LacO 
array center in the average images after 
subtraction of nuclear Halo-RPB1 background 
(see supplementary methods). ** denotes a 
statistically significant increase compared with 
the Lacl condition (P < 0.01, two-sample t test). 
Error bars represent bootstrapped SD (45). 
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end, we coexpressed both EYFP-LCD-LacI and 
mCherry-LCD in LacO-containing cells. mCherry- 
LCD lacking a DBD becomes enriched at the array 
only when it can interact with the coexpressed 
LCD that is fused to LacI (Fig. 2A and fig. S5A). 
Notably, the array can enrich mCherry-LCD over 
a wide range of expression levels. The EWS-LaclI- 
bound LacO array also enriches endogenous 
EWSRI, as detected by immunofluorescence (fig. 
S5B). Therefore, mCherry-LCD enrichment at 


the array is most likely due to specific LCD-LCD 
interactions rather than potential nonspecific 
overexpression artifacts. Using this two-color 
imaging assay, we confirmed homotypic self- 
interactions of all tested LCDs (from the FET 
family and Spl). Intriguingly, although all three 
FET LCDs interacted among themselves, none of 
them interacted with the Sp1 LCD (Fig. 2, Aand 
B), suggesting that LCD interactions exhibit 
strong sequence specificity that is likely an es- 
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Fig. 2. LCD hub formation involves selective protein-protein interactions, which can be 
disrupted by 1,6-HD with sequence-dependent sensitivity. (A) Confocal fluorescence 

images of U2OS cells containing LacO array 1 that coexpress various combinations of 
mCherry-LCD and EYFP-LCD-Lacl. The region surrounding the LacO array is zoomed in. 

B) Quantification of the enrichment of mCherry-LCD (red) at the LacO array 1 bound by various 
EYFP-labeled LCD-Lacl fusion proteins (green), calculated as the peak mCherry fluorescence 
intensity at the array divided by the average intensity immediately surrounding the array 

fig. S5A). Null, mCherry not fused to any LCD. An mCherry enrichment at the array above 

1 suggests LCD-LCD interactions. * denotes a statistically significant difference above 1 (P < 0.05, 
one-sample t test). NS, nonsignificant difference above 1. Error bars represent SE. (C) Fluorescence 
images of FUS and Sp1 LCD hubs before (0 s) and after (29 s) addition of 10% 1,6-HD. 

D) Number of nuclear puncta formed by FUS or Sp1 LCD surviving over time upon addition 

of 1,6-HD at different concentrations. Error bars represent SE. 
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sential feature underlying combinatorial TF reg- 
ulation of gene expression. 

To better understand the nature of LCD-LCD 
interactions, we treated cells with 1,6-hexanediol 
(1,6-HD), an aliphatic alcohol known to dissolve 
various intracellular membraneless compartments 
and FUS hydrogels in vitro through disruption of 
hydrophobic interactions (28-30). We observed 
that both FUS and Spl LCD hubs rapidly dis- 
assemble within 30 s when exposed to 10% 1,6- 
HD. The LacO-associated LCD-induced hub shrank 
to a size comparable to that of the array bound 
by LacI alone, whereas all nuclear puncta not 
associated with LacO disappeared (Fig. 2C and 
Movie 1). We also found that 2,5-hexanediol (2,5- 
HD), a less hydrophobic derivative of 1,6-HD that 
barely melts FUS hydrogels in vitro (28), disrupts 
LCD hubs less efficiently in live cells (fig. S5, C 
and D). This correlation between hydrophobicity 
of hexanediols and LCD hub melting suggests 
that these aliphatic alcohols may directly in- 
fluence LCD-LCD interactions by disrupting key 
hydrophobic contacts. These in vivo results also 
mirror in vitro hydrogel studies using these same 
disrupting agents (28). 

Spl LCD hubs were disrupted significantly 
faster and more extensively than FUS LCD hubs 
with 2 or 5% 1,6-HD (Fig. 2D). Thus, although a 
combination of intermolecular forces may con- 
tribute to LCD hub formation, our results in- 
dicate that hydrophobic interactions might be 
more sensitive to disruption and play a more 
dominant role in Sp1 LCD self-interactions than 
FUS LCD, consistent with the Sp1 LCD contain- 
ing hydrophobic residues sparsely interspersed 
among Q repeats (37). The differential sequence 
dependence of LCD-LCD interactions revealed 
by 1,6-HD treatment may be correlated to the 
selectivity of homo- and heterotypic LCD inter- 
actions observed above. 


LCD-LCD interactions are 
highly dynamic 


To study the dynamics of protein-protein inter- 
actions between LCD pairs, we coexpressed 


0.00 sec 


Movie 1. LacO array—associated and self- 
aggregated non-array—affiliated FUS LCD 
hubs in live cells are quickly disrupted by 
10% 1,6-hexanediol (related to Fig. 2C). 
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EYFP-LCD-LacI and Halo-LCD in the LacO- 
containing U2OS cells and performed SPT of 
Halo-LCD to measure residence times (RTs) of 
LCD-LCD interactions within the LacO-associated 
hub (Fig. 3A, top). For all LCDs tested, RTs re- 
sulting from self-interactions fell in the range of 
11 to 33 s (Fig. 3B). When EYFP-LCD and Halo- 
LCD from the FET family were coexpressed at 
high levels, they spontaneously formed hubs that 
are unaffiliated with the array and resemble in- 
tranuclear puncta (Fig. 3A, bottom). These non- 
array hubs bind Halo-LCD via homo- or heterotypic 
interactions with even shorter RTs (7 to 10 s). As 
expected, the Sp1 LCD that failed to interact with 
the FUS LCD had an RT of <1 s at the non-array 
FUS LCD hubs (Fig. 3B). The fact that RTs of 
many LCDs in self-aggregated hubs unaffiliated 
with genomic DNA are substantially shorter than 
in hubs formed at the LacO array suggests that 
TF-DNA interactions that maintain a high local 
concentration of TF LCDs contribute to stabilizing 
LCD-LCD interactions and vice versa. Together, 
these findings reveal the rapid, reversible, and 
interdependent nature of LCD-LCD and TF-DNA 
interactions as well as their propensity to form 
local high-concentration hubs that likely stabilize 
multicomponent complexes—e.g., transcription pre- 
initiation complex, a prerequisite for transactivation. 


EWS/FLI1 forms hubs at endogenous 
GGAA microsatellites 


Having unmasked the sequence specificity and 
dynamic nature of LCD-LCD interactions by using 
synthetic LacO arrays in living cells, we next 
tested LCD behavior at native GGAA micro- 
satellites (>20 GGAA repeats) in the Ewing’s 
sarcoma cell line A673 (32-35). These cancer- 
derived cells carry a chromosomal translocation 
t(11;22)(q24;q12) producing a fusion oncogene, 
EWS/FLI, that encodes a potent TF consisting 
of a trans-activating LCD from EWSRI1 and the 
DBD from FLI1 that targets GGAA sequences 
(Fig. 4A). 

To visualize the behavior of endogenously 
expressed EWS/FLII1, we fused a HaloTag to its 
DBD using CRISPR-Cas9-mediated genome edit- 
ing of A673 cells (Fig. 4, A and B, and fig. S6A) 
(36). This knock-in strategy allowed us to image 
fluorescently tagged endogenous EWS/FLII at 
its normal expression levels (Fig. 4B), which was 
essential because LCDs tend to self-aggregate 
and behave aberrantly upon overexpression. To 
ensure that Halo-tagging does not disrupt trans- 
activation functions of EWS/FLII, we confirmed 
that EWS/FLI1-Halo activates a luciferase reporter 
construct containing a GGAA microsatellite- 
driven promoter (33) as efficiently as wild-type 
(WT) EWS/FLII (fig. S6C). More importantly, 
using the gold standard neoplastic transforma- 
tion assay (37), we confirmed that the EWS/ 
FLI1-Halo knock-in A673 cells form colonies in 
soft agar much like the WT A673 cells—albeit, 
less efficiently (fig. S6, D and E). 

We next performed high-resolution lattice 
light-sheet microscopy and found that EWS/ 
FLI1 forms many small interaction hubs (>1000 
per nucleus) in the nucleus (Fig. 4C and Movie 2). 
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Fig. 3. LCD-LCD interactions involved in hub formation are highly dynamic. (A) Snapshots 

of a two-color SPT movie simultaneously imaging EYFP-labeled (green) EWS LCD-Lacl (top, 
forming a LacO-associated LCD hub) or EWS LCD (bottom, forming self-aggregated LCD hubs 

not affiliated with the LacO array) and Halo-tagged EWS LCD (2 nM PA-JF646 labeled, red) in 
U2OS cells containing LacO array 1. A white dashed contour outlines the cell nucleus. We imaged the 
hubs in the EYFP channel (green) and tracked individual Halo-EWS LCD molecules with an 
acquisition time of 500 ms in the PA-JF646 channel (red). (B) Residence times of LCD (red) bound 
at the LacO-array—associated LCD hub or at self-aggregated LCD hubs not affiliated with the 

array (green). *P < 0.05, two-sample t test. Error bars represent SE. 


The detected number of intranuclear hubs has 
the same order of magnitude as the total number 
of EWS/FLI1-bound GGAA microsatellites across 
the human genome (~6000) estimated by chro- 
matin immunoprecipitation sequencing and bio- 
informatics analyses (38). To examine the spatial 
relationship between EWS/FLI1 hubs and GGAA 
microsatellites, we performed simultaneous con- 
focal imaging of EWS/FLI1-Halo and 3D DNA 
FISH targeting genes adjacent to GGAA micro- 
satellites that are regulated by EWS/FLI1 (Fig. 
4D), including CAV1, FCGRT, ABHD6, KDSR, and 
KIAAI797 (33, 39). Although EWS/FLI1 enrich- 
ment is detected at many single loci of these genes, 
the crowded distribution of intranuclear EWS/ 
FLI1 hubs makes it difficult to clearly visualize 
EWS/FLII enrichment at single target loci. By 
recording images of ~1000 loci for each gene, 
the signal-to-noise ratio is markedly improved 
to reveal specific EWS/FLI1 enrichment at GGAA 
repeats, whereas no enrichment was seen at non- 
GGAA gene loci (Fig. 4E). These results indicate 


that EWS/FLI1 forms hubs at endogenous GGAA 
microsatellite DNA elements. 


Dynamic EWS LCD-LCD interactions 
mediate formation of EWS/FLI1 hubs 


We previously showed that because of LCD-LCD 
interactions, the LCD-LacI copy number in the 
LacO-associated hub increases much faster with 
the TF concentration than that of LacI alone, 
reaching levels that are orders of magnitude 
higher than the number of available TF bind- 
ing sites at the LacO array (Fig. 1, C and D, and 
fig. S2, C to F). Using the methods established 
earlier, we estimated the intranuclear concentra- 
tion of endogenously Halo-tagged EWS/FLI in 
A673 cells to be ~200 nM and the median copy 
number of EWS/FLI1 per hub at GGAA micro- 
satellites to be 24 (fig. S7, A and B). It was re- 
ported that most EWS/FLI1-bound microsatellites 
contain 11 to 19 GGAA motifs, with a median 
around 15 (38). Because the DBD of FLI occupies 
two consecutive GGAA repeats (33), the median 
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number of EWS/FLI1 molecules recruited to a 
microsatellite via direct DNA-protein interaction 
is estimated to be 8. The fact that there are a 
significantly greater number of EWS/FLI1 mole- 
cules in a GGAA-affiliated hub than what direct 
binding to GGAA repeats could accommodate 
suggests that, like the synthetic LacO arrays, these 
native genomic elements can also efficiently 
nucleate EWS LCD-LCD interactions to form 
local high-concentration TF hubs. Moreover, when 
transiently expressing EWS/FLI1-Halo or a Halo- 
tagged LCD deletion mutant (FLI1 DBD) in U20S 
cells, we again observe that the total number of 
EWS/FLI1 molecules in the GGAA-affiliated hubs 
increases much faster than FLI1 DBD as a function 
of TF concentration (fig. S7, C and D), providing 


Fig. 4. Combined DNA FISH and EWS/ 
FLI1-Halo imaging show that 
endogenous EWS/FLI1 forms hubs 

at GGAA microsatellites. (A) Schematic 
for GGAA microsatellites in the 

A673 genome nucleating hubs of 
endogenously Halo-tagged EWS/FLI1. 
B) Western blot of EWS/FLI1 and B-actin 
normalization control) from clonal 
EWS/FLI1-Halo knock-in (KI), WT and 
clonal EWS/FLI1 knockout (KO) A673 
ines. (C) z-projected 3D image of 
endogenous EWS/FLI1-Halo in an A673 
cell nucleus (stained with 200 nM Halo 
igand JF549) taken on the lattice 
ight-sheet microscope. (D) Confocal 
fluorescence images of 3D DNA FISH 
targeting GGAA microsatellite—adjacent 
CAVI1 gene (enhanced Cy5 labeled, 

red) and endogenous EWS/FLI1-Halo 
(JF549 labeled, green). The zoomed-in 
views depict the region surrounding 
one particular CAV1 locus. EWS/ 
FLI1-Halo enrichment at the locus 

is visible but buried in high background. 
(E) Averaged two-color images 

of five GGAA microsatellite—adjacent 
gene loci (CAV1, FCGRT, ABHD6, KDSR, 
and KIAA1797) and two gene loci not 
containing a GGAA microsatellite 
(Non-GGAA locus 1 targeting ADGRA3 
and locus 2 targeting REEP5). The 

right column shows average surface 
plots of EWS/FLI1-Halo. 
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strong evidence that LCD-LCD interactions occur 
at GGAA-affiliated EWS/FLI1 hubs. 

We previously observed that the formation of 
an LCD interaction hub slows down dissociation of 
LCD-LaclI from the LacO array (Fig. 1E and fig. 
S3F). If LCD-LCD interactions are also involved 
in EWS/FLI1 hub formation at GGAA micro- 
satellites, we expect the RT of EWS/FLI within 
GGAA-affiliated hubs to be longer than that of 
EWS/FLI1 outside hubs. We stained the EWS/ 
FLI1-Halo knock-in A673 cells with two fluores- 
cent ligands (40, 41): High-concentration JF549 
staining allows visualization of EWS/FLI1 hubs 
in the cell nucleus, whereas low-concentration 
PA-JF646 staining allows real-time tracking of 
individual EWS/FLI1 molecules (Fig. 5A and 


EWS LCD 

FLI1 DBD 
GGAA repeats on 
microsatellite DNA 


CAV1 DNA 


: 


EWS/FLI1-Halo 


Movie 3). SPT revealed the average RTs of EWS/ 
FLI1 in and outside the GGAA-affiliated hubs 
to be 90 and 16 s, respectively (Fig. 5B and fig. S8, 
A and B). The fact that EWS/FLI1 binds to GGAA 
repeats for a significantly longer time again 
suggests that EWS LCD-LCD interactions are 
involved in the formation of the GGAA-affiliated 
hubs. It is very likely that both LCD-LCD inter- 
actions and DBD binding to GGAA repeats work 
together to stabilize hub formation, much as we 
observed for LCD-LaclI at the LacO array. 

To confirm that hub formation in this native 
setting is dependent on the EWS LCD, we de- 
termined how mutations in the LCD might affect 
RTs of EWS/FLI1. We started by replacing dif- 
ferent numbers (7 = 3, 7, 10, 17, or 29) of tyrosines 
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(Y) in the EWS LCD (residues 47 to 266 of EWSR1) 
with serines (S) and then testing the self- 
interaction capability of mutant LCDs [EWS 
(YS,,)] using the LacO array assay established 
earlier. As previously shown, when we coex- 


pressed EYFP-EWS-LacI and mCherry-EWS, 
the mCherry signal became enriched at the 
LacO array owing to EWS LCD self-interaction. 
Notably, when we replaced WT EWS in both 
fusion proteins with EWS(YS,,,), mCherry enrich- 


ment at the array progressively decreased with 
an increasing number of Y-to-S mutations (fig. 
S9A) and vanished for EWS(YS._9), in which all 
tyrosines are replaced (Fig. 5C). Similarly, we 
found that EWS(YS.,) does not interact with 
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Fig. 5. Dynamic LCD-LCD interactions occur at GGAA microsatellites, 
which stabilize EWS/FLI1 binding and drive its transactivation function. 

(A) Snapshots of an SPT movie imaging endogenous EWS/FLI1-Halo labeled with 
two Halo ligands, JF549 (200 nM) and PA-JF646 (20 nM). We imaged the 
EWS/FLI1-Halo hubs in the JF549 channel (green) and tracked individual EWS/ 
FLI1-Halo molecules in and outside the hubs in the PAJF646 channel (red). 

(B) Residence times of EWS/FLII bound in hubs are longer than its residence 
times outside hubs, as determined by SPT (**P < 0.01, two-sample t test). Error 
bars represent SE. (©) EWS LCD is enriched at LacO array 1 bound by EWS 
LCD-Lacl, but EWS(YSz29) LCD is not recruited to the array by EWS(YS29) LCD- 
Lacl. However, EWS(YF29) LCD is recruited to the array by EWS(YF29) LCD-Lacl. 
(D) (Top) Schematic of proteins transiently expressed in EWS/FLI1 KO A673 cells: 
Halo-tagged EWS/FLI1, EWS(YS)/FLIL, or FLIT DBD. (Bottom) Residence times of 
EWS/FLI1 and its variants binding in and outside their hubs, as determined by SPT. 
*P < 0.05, two-sample t test. Error bars represent SE. (E) Snapshots of an SPT 


Chong et al., Science 361, eaar2555 (2018) 27 July 2018 


movie simultaneously imaging SNAPrtagged EWS/FLII (200 nM JF549 labeled, 
green) and Halo-tagged EWS or EWS(YS) LCD (20 nM PA-JF646 labeled, red) in 
EWS/FLIL KO A673 cells. Individual LCD-Halo molecules were tracked with the 
strategy described in (A). (F) Residence times of EWS bound at EWS/FLII hubs 
are longer than for EWS(YS) LCD, as determined by SPT (*P < 0.05, two-sample 
t test). Error bars represent SE. (G) Luciferase assay shows that EWS/FLI1 but 
not EWS(YS)/FLI1 or FLIL DBD transactivates a GGAA microsatellite—driven 
reporter (**P < 0.01, two-sample t test). Error bars represent SE. (H) RT-qPCR 
shows down-regulation of GGAA microsatellite—associated EWS/FLI1 target 
genes in A673 cells upon EWS/FLI1 KO. Stable expression of exogeneous (Exo) 
EWS/FLIL, but not of the mutant EWS(YS)/FLIL, rescues the expression defect in 
EWS/FLI1 KO A673 cells. For each target gene, the mRNA level was normalized 
using five different invariant genes (fig. SIOA) and graphed as a fold change 
relative to the MRNA level present in the WT A673 line (set to 1). *P < 0.05, 
two-sample t test. NS, not statistically significant. Error bars represent SD. 
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WT EWS (fig. S9, B and C). By contrast, a mutant 
replacing all 29 tyrosines with phenylalanine (F) 
[EWS(YF59)] retains hub formation activity with 
itself and with WT EWS (Fig. 5C and fig. S9D), 
suggesting that aromatic amino acids and hydro- 
phobic contacts represent major drivers of EWS 
LCD-LCD interactions. The conformational dis- 
order, proline residues, and acidic amino acids of 
the EWS LCD might play a role in keeping these 
key hydrophobic residues exposed to solvent 
and potential binding partners without being 
sequestered in a collapsed state (42). 

Next, we probed the effects of mutations that 
disrupt LCD hub formation on RTs of EWS/FLI. 
To examine behaviors of EWS/FLI1 variants in 
A673 cells without interference of endogenous 
EWS/FLII, we generated an EWS/FLI1 knockout 
A673 line using CRISPR-Cas9-mediated genome 
editing (Fig. 4B and fig. S6B) and verified that 
transiently and moderately reexpressed EWS/ 
FLII-Halo in the knockout line exhibited binding 
dynamics comparable to that of endogenous 
EWS/FLI1-Halo (Fig. 5, B and D, and fig. S8C). 
We then transiently expressed similar levels 
of FLI1 DBD-Halo or a Halo-tagged 37-residue 
Y-to-S mutant [EWS(YS)/FLI1]. Both mutants 
still displayed some hubs in the nucleus, but the 
hubs are considerably diminished. SPT revealed 
that the in-hub RTs of the mutants become 
significantly reduced (by 51 to 65%) relative to 
those of WT EWS/FLI1, whereas their outside- 
hub RTs remain largely unchanged (Fig. 5D). 
Together, these results confirm that LCD-LCD 
interactions drive the formation of EWS hubs at 
GGAA microsatellites. 

To measure the dynamics of just the protein- 
protein interactions occurring within the EWS 
LCD hubs, we transiently expressed SNAP;-tagged 
EWS/FLII and Halo-tagged EWS in the EWS/FLI1 
knockout line and labeled both fusion proteins 
using fluorescent ligands with distinct emission 
spectra (40, 41). Whereas EWS/FLI1-SNAP; forms 
hubs at GGAA microsatellites via protein-DNA 
binding, EWS LCD-Halo, which does not interact 
with DNA, binds to EWS/FLI1 hubs only via 
protein-protein interactions. We visualized EWS/ 


Movie 2. 360° view of a 3D image of 
endogenous EWS/FLI1-Halo in an A673 cell 
nucleus (stained with 200 nM Halo 

ligand JF549) taken on the lattice 
light-sheet microscope (related to Fig. 4C). 


Chong et al., Science 361, eaar2555 (2018) 
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FLI1 hubs and simultaneously tracked individual 
EWS LCD molecules that bind to the hubs (Fig. 
5E). SPT revealed the average RT of EWS LCD in 
EWS/FLI1 hubs to be 16 s, which suggests that 
LCD-LCD interactions are highly dynamic (Fig. 
5F). As expected, the mutant EWS(YS) LCD has 
a significantly shorter RT (~7 s) at EWS/FLI1 
hubs, consistent with its diminished interaction 
with the EWS LCD. 


EWS LCD-LCD interactions are essential 
for transcription and transformation 
functions of EWS/FLI1 


Finally, we tested whether LCD-LCD interactions 
influence EWS/FLI1 functions. We found that 
whereas EWS/FLII efficiently induces gene ac- 
tivation at a GGAA microsatellite in a luciferase 
assay, the mutant EWS(YS)/FLI1 and FLI1 DBD 
do not (Fig. 5G). We further engineered the EWS/ 
FLI knockout A673 line to stably express EWS/ 
FLI1 or EWS(YS)/FLI and performed reverse 
transcription quantitative polymerase chain re- 
action (RT-qPCR) to measure the expression levels 
of GGAA microsatellite-associated EWS/FLI1 
target genes. As expected, we found that expres- 
sion of EWS/FLH, but not EWS(YS)/FLI, specif- 
ically rescues the gene expression defect in the 
knockout line, indicating that EWS LCD-LCD 
interactions are required for transactivation (Fig. 
5H and fig. S10A). Moreover, the knockout line 
stably expressing EWS/FLI1, but not EWS(YS)/ 
FLII, forms colonies in soft agar like WT A673 
cells (fig. S1IOB). This demonstrates that EWS 
LCD-LCD interactions are required for oncogenic 
transformation. Taken together with previously 
published RT-qPCR and RNA sequencing data 
in mesenchymal stem cells showing the impor- 


Movie 3. Two-color movie imaging 
endogenous EWS/FLI1-Halo in a knock-in 
A673 cell nucleus stained with two 
HaloTag ligands, JF549 (200 nM) 

and PA-JF646 (20 nM). The image acquisition 
time was 500 ms. We took time-lapse 
images with a 10-s interval in the JF549 
channel to visualize EWS/FLI1 hubs and 
imaged continuously in the PA-JF646 channel 
for SPT of individual EWS/FLI1-Halo 
molecules (related to Fig. 5A). 


tant role of EWS LCD in inducing expression of 
GGAA microsatellite-associated genes (43), our 
results suggest that the formation of EWS LCD- 
dependent hubs is essential for EWS/FLI to 
activate transcription of these target genes and 
drive oncogenic gene expression programs in 
Ewing’s sarcoma. 


Discussion 


DNA binding TFs are key regulators of eukaryotic 
gene expression. Early studies of TFs revealed 
their well-structured DBDs and identified func- 
tionally critical activation domains (ADs) required 
for transcription. It later became evident that 
many transactivation sequences contain intrin- 
sically disordered LCDs, but how they mediate 
transactivation remained unclear. Despite the 
composition of LCDs and their generally unstruc- 
tured nature, ADs must interact with specific 
binding partners to activate transcription. How- 
ever, it has been challenging if not impossible 
to directly measure selective LCD-mediated tar- 
get recognition by ADs in vivo. Two other key 
aspects of transcriptional control mechanisms 
have also remained largely uncharted: What is 
the nature of the protein-protein transactions 
that drive gene activation, and how stable or dy- 
namic are these critical interactions in living cells? 

We addressed these problems by employing 
single-molecule imaging to visualize LCD-LCD 
interactions in live cells. Our findings indicate 
that TF-TF interactions are extremely transient, 
with RTs of 5 to 20 s and rarely longer than 
1 min. These studies lead us to propose that 
transactivation domains function by forming 
transient local regions of high TF concentra- 
tions or “hubs” (sometimes also referred to as 
clusters) via dynamic, multivalent, and sequence- 
specific LCD-LCD interactions. A key future en- 
deavor will be to unlock the biochemical and 
structural basis for selective LCD-LCD inter- 
actions. Such selectivity is likely essential to 
implement the complex combinatorial logic of 
transcriptional regulation. 

Recent advances in live-cell single-molecule 
imaging have not only opened a new frontier 
for studying transcription in vivo but have also 
provided an opportunity to gain greater insight 
into the nature and behavior of LCDs and their 
propensity to remain largely as intrinsically dis- 
ordered peptide sequences. By deploying high- 
resolution single-molecule imaging of TF LCD 
interactions in live cells, our studies offer a pow- 
erful complement to pioneering in vitro studies 
that provided the first clues about LCD inter- 
actions (0). Importantly, to the extent that one 
can make comparisons between hydrogels and 
intracellular LCD hub formation, many aspects of 
FET-LCD function uncovered in vitro are borne 
out within live cells. For example, the behavior 
of LCDs upon mutations, disruption by hexane- 
diols, and interaction with RNA Pol II observed 
in vivo are generally consistent with the role of 
LCDs proposed on the basis of in vitro assays. 
In addition, single-molecule live-cell imaging re- 
vealed several new aspects of LCD-driven inter- 
actions. Most notable are the fast dynamics and 
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Fig. 6. A model for A 
functional LCD-LCD 
interactions in vivo: 

From hubs to phase 
separation. (A) Dynamic 

and sequence-specific 
LCD-LCD interactions 


Gow 


TF-LCD 
sequence- 


¥ X specificity 


“oe 


drive hub formation in 


Selective LCD interactions 


live cells. (B) LCD- 
dependent transactiva- 
tion occurs in hubs 


f ™ 


formed over a broad 
range of TF concentra- 
tions. At endogenous 
concentrations, TF LCDs 
form transactivation hubs 
at native genomic loci 
without undergoing 
evident phase separation. 
Upon TF LCD 
overexpression, 

phase separation is 
observed at synthetic TF 
binding site arrays. 


sequence specificity of LCD interactions as they 
form transient local high-concentration hubs that 
drive transactivation (Fig. 6). We were also in- 
trigued by the formation of LCD-dependent hubs 
throughout the nucleoplasm that are not asso- 
ciated with cognate genomic DNA. These LCD 
interaction-driven puncta, which display a ca- 
pacity to interact with RNA Pol II, sensitivity to 
1,6-HD, and fast dynamics (recovery times of 7 
to 10 s), may provide an opportunity in future 
studies to further probe mechanisms govern- 
ing LCD-LCD-mediated hub formation and 
transactivation. 

To analyze the behavior of LCD-LCD inter- 
actions in live cells, we exploited the advantages 
of various imaging platforms and developed two 
distinct in vivo cell-based assays: one involving 
large synthetic TF binding site (LacO) arrays and 
the other targeting endogenous GGAA micro- 
satellite regulatory elements in the human ge- 
nome. When used in combination, the two assays 
afford powerful and complementary platforms for 
probing the properties of TF LCD interactions in 
the live-cell context. The highly reiterated LacO 
binding sites can serve as a useful cell-based 
assay system capable of nucleating local high- 
concentration hubs in vivo for any protein domain 
or LCD of interest through LacO-LacI mediated 
binding. These arrays can be readily detected by 
fluorescence imaging with a high signal-to-noise 
ratio and allow flexibility in probing various inter- 
action properties of LCDs while offering a con- 
venient alternative to various in vitro assays such 
as hydrogel polymerization and droplet forma- 
tion for studying gelation and phase separation 
of intrinsically disordered proteins, two processes 
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that are likely coupled with each other under 
certain physiological settings (44). 

Our studies were not designed to address 
the structure and nature of LCD-driven phase 
separated compartments, but under certain over- 
expression conditions we detected what appears 
to be liquid-liquid phase separation (i.e., spherical 
shape and local changes in refractive index). Al- 
though we can detect apparent liquid-liquid phase 
separation with gross overexpression of LCDs, 
we did not obtain evidence for phase separation 
of TF hubs (i.e., EWS/FLI1) formed at endoge- 
nous expression levels. However, transactivation 
by TF LCD hubs is observed at endogenous TF 
levels at native chromosomal loci in the absence 
of detectable phase separation. Given the tran- 
sience of LCD-LCD interactions and our direct 
measurements of TF concentrations in the nucleus 
and within hubs, we surmise that LCD-dependent 
transactivation can occur in hubs formed over 
a broad range of TF concentrations (100 nM to 
100 uM) and time scales—from extremely brief 
specific and nonspecific LCD-LCD and TF-DNA 
binding events (0.1 to 1 s) to assembly of rel- 
atively stable hubs (minutes) driven by specific, 
multivalent interactions. Both the composition 
and diversity of LCDs in hubs and their inter- 
action specificity could influence the range of 
their operational concentrations and their po- 
tential for phase separation and/or polymer 
formation. New insights regarding the rapid 
binding dynamics and functional importance 
of TF LCDs (i.e., LCD-dependent oncogenic po- 
tential of EWS/FLI1) suggest that understand- 
ing these mechanisms may also inform our 
ability to develop strategies to modulate gene 
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expression in the context of disease. Given the 
transient interactions exhibited by TFs and the 
critical role of gene dysregulation in disease 
(i.e., EWS/FLI in Ewing’s sarcoma), our findings 
offer the potential to develop single-molecule 
imaging platforms to screen drugs targeting gene 
regulatory pathways. In particular, we imagine 
that moderate- to high-throughput screens for 
gene expression inhibitors or activators based on 
high-resolution single-cell and single-molecule 
imaging could provide a strategy to probe large 
sectors of the proteome that have resisted tradi- 
tional in vitro and cell-based screens for small 
molecules. With new classes of chemicals, natural 
products, or peptidomimetic libraries, it may 
even be possible to eventually target LCDs that 
are the key drivers of regulatory protein-protein 
interactions and hub formation involved in gene 
activation and potentially many other biomolecular 
interactions implicated in disease. Finally, al- 
though we examined only a small subset of TF 
LCDs, the fundamental principles that we have 
uncovered about the rapid dynamics and hub 
mechanisms driving LCD-LCD interactions may 
be applicable to other classes of regulatory pro- 
teins and biomolecular interactions occurring in 
a variety of cell types. 


Materials and methods summary 


The number of LacO repeats in the LacO arrays 
was determined by RT-qPCR. FCS and fluores- 
cence intensity measurement were performed on 
fluorescently labeled TFs in live cells. By com- 
paring the results with standard concentration 
curves of a purified fluorescent tag, the TF nuclear 
concentration and its copy number at hubs were 
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determined. FRAP and SPT were performed to 
measure interaction dynamics between various 
TFs and their target genomic loci and to examine 
how LCD-LCD interactions affect TF-DNA inter- 
action dynamics. SPT was also used to determine 
LCD-LCD interaction dynamics. Two-color con- 
focal fluorescence imaging was used to examine 
interactions between different classes of LCDs 
and between LCD hubs and RNA Pol II. 
CRISPR-Cas9-mediated genome editing was 
performed to label the endogenous EWS/FLI1 in 
A673 cells with a HaloTag or to knock out the 
protein, allowing fluorescence imaging or func- 
tional studies of EWS/FLI1. Luciferase and soft 
agar colony formation assays were used to verify 
the functions of EWS/FLI1-Halo. Lattice light- 
sheet microscopy was used to visualize intra- 
nuclear hubs of the endogenous EWS/FLI-Halo. 
Simultaneous confocal imaging of EWS/FLI1- 
Halo and 3D DNA FISH were performed to ex- 
amine the spatial relationship between hubs of 
endogenous EWS/FLI1 and GGAA microsatellites. 
Luciferase, RT-qPCR, and soft agar colony forma- 
tion assays were used to examine the effects of 
Y-to-S mutations on the transactivation and 
transformation functions of EWS/FLII1. Detailed 
descriptions for all materials and methods are 
provided in the supplementary materials. 
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INTRODUCTION: Mammalian genes that play 
prominent roles in healthy and diseased cellular 
states are often controlled by special DNA el- 
ements called super-enhancers (SEs). SEs are 
clusters of enhancers that are occupied by an 
unusually high density of interacting factors 
and drive higher levels of transcription than 
most typical enhancers. This high-density as- 
sembly at SEs has been shown to exhibit sharp 
transitions of formation and dissolution, form- 
ing in a single nucleation event and collapsing 
when chromatin factors or nucleation sites are 
deleted. These features led us to postulate that 
SEs are phase-separated multimolecular as- 
semblies, also known as biomolecular conden- 
sates. Phase-separated condensates, such as the 
nucleolus and other membraneless cellular 


Typical enhancer 


HH Enhancer 
O Transcription factor 
@ Coactivator 


~— Intrinsically 
disordered region 


bodies, provide a means to compartmentalize and 
concentrate biochemical reactions within cells. 


RATIONALE: SEs are formed by the binding 
of master transcription factors (TFs) at each 
component enhancer, and these recruit un- 
usually high densities of coactivators, includ- 
ing Mediator and BRD4. Mediator is a large 
(~1.2 MDa) multisubunit complex that has mul- 
tiple roles in transcription, including bridging 
interactions between TFs and RNA polymerase 
II (RNA Pol II). BRD4: facilitates the release 
of RNA Pol II molecules from the site of tran- 
scription initiation. The presence of MEDI, a 
subunit of Mediator, and BRD4 can be used 
to define SEs. We reasoned that if transcrip- 
tional condensates are formed at SEs, then 


Super-enhancer 


Phase separation of coactivators compartmentalizes and concentrates the 
transcription apparatus. Enhancers are gene regulatory elements bound by transcription 
factors that recruit coactivators and the transcription apparatus (not shown) to regulate 
gene expression. Super-enhancers are clusters of enhancers bound by master transcription 
factors that concentrate high densities of coactivators and the transcription apparatus to drive 
robust expression of genes that play prominent roles in cell identity. This is achieved by the 
phase separation of coactivators, which is driven in part by high-valency and low-affinity 


interactions of intrinsically disordered regions. 
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MEDI and BRD4 should be visualized as dis- 
crete bodies at SE elements in cell nuclei. These 
bodies should exhibit behaviors described for 
liquid-like condensates. We investigated these 
possibilities by using murine embryonic stem 
cells (mESCs), in which SEs were originally de- 
scribed. Because intrinsically disordered regions 
(IDRs) of proteins have been implicated in 
condensate formation, we postulated that the 
large IDRs present in MED1 and BRD4 might 
be involved. 


RESULTS: We found that MED1 and BRD4 
occupy discrete nuclear bodies that occur at 
SEs in mESCs. These bodies exhibit properties 
of other well-studied biomolecular conden- 

sates, including rapid re- 
covery of fluorescence after 
Read the full article photobleaching and sen- 
at http://dx.doi. sitivity to 1,6-hexanediol, 
org/10.1126/ which disrupts liquid-like 
science.aar3958 condensates. Disruption of 
MED1 and BRD4 bodies 
by 1,6-hexanediol was accompanied by a loss 
of chromatin-bound MED1 and BRD4: at SEs, 
as well as a loss of RNA Pol II at SEs and SE- 
driven genes. The IDRs of both MED1 and 
BRD4 formed phase-separated liquid droplets 
in vitro, and these droplets exhibited features 
characteristic of condensates formed by net- 
works of weak protein-protein interactions. The 
MED1-IDR droplets were found to concentrate 
BRD4 and RNA Pol II from transcriptionally 
competent nuclear extracts, which may reflect 
their contribution to compartmentalizing and 
concentrating biochemical reactions associated 
with transcription at SEs in cells. 


CONCLUSION: Our results show that coacti- 
vators form phase-separated condensates at 
SEs and that SE condensates compartmentalize 
and concentrate the transcription apparatus 
at key cell-identity genes. These results have im- 
plications for the mechanisms involved in the 
control of genes in healthy and diseased cell 
states. We suggest that SE condensates facilitate 
the compartmentalization and concentration 
of transcriptional components at specific genes 
through the phase-separating properties of 
IDRs in TFs and cofactors. SE condensates 
may thus ensure robust transcription of genes 
essential to cell-identity maintenance. These 
properties may also explain why cancer cells 
acquire large SEs at driver oncogenes and why 
SEs that facilitate transcriptional dysregula- 
tion in disease can be especially sensitive to 
transcriptional inhibitors. 


The list of author affiliations is available in the full article online. 
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Super-enhancers (SEs) are clusters of enhancers that cooperatively assemble a high density 
of the transcriptional apparatus to drive robust expression of genes with prominent roles in 
cell identity. Here we demonstrate that the SE-enriched transcriptional coactivators BRD4 
and MED1 form nuclear puncta at SEs that exhibit properties of liquid-like condensates and 
are disrupted by chemicals that perturb condensates. The intrinsically disordered regions 
(IDRs) of BRD4 and MED1 can form phase-separated droplets, and MED1-IDR droplets can 
compartmentalize and concentrate the transcription apparatus from nuclear extracts. These 
results support the idea that coactivators form phase-separated condensates at SEs that 
compartmentalize and concentrate the transcription apparatus, suggest a role for coactivator 
IDRs in this process, and offer insights into mechanisms involved in the control of key 


cell-identity genes. 


hase separation of fluids is a physicochemical 
process by which molecules separate into 
a dense phase and a dilute phase. Phase- 
separated biomolecular condensates, which 
include the nucleolus, nuclear speckles, 
stress granules, and others, provide a mecha- 
nism to compartmentalize and concentrate bio- 
chemical reactions within cells (7-3). Biomolecular 
condensates produced by liquid-liquid phase 
separation allow rapid movement of compo- 
nents into and within the dense phase and ex- 
hibit properties of liquid droplets such as fusion 
and fission (4). Dynamic and cooperative multi- 
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valent interactions among molecules, such as 
those produced by certain intrinsically disordered 
regions (IDRs) of proteins, have been implicated 
in liquid-liquid phase separation (5-7). 
Enhancers are gene regulatory elements bound 
by transcription factors (TFs) and other compo- 
nents of the transcription apparatus that func- 
tion to regulate expression of cell type-specific 
genes (8-13). Super-enhancers (SEs)—clusters of 
enhancers that are occupied by exceptionally 
high densities of transcriptional machinery— 
regulate genes with especially important roles 
in cell identity (/4, 15). DNA interaction data 
show that enhancer elements in the clusters are 
in close spatial proximity with each other and 
the promoters of the genes that they regulate 
(16-18), consistent with the notion of a dense 
assembly of transcriptional machinery at these 
sites. This high-density assembly at SEs has been 
shown to exhibit sharp transitions of formation 
and dissolution, forming as the consequence of a 
single nucleation event (19, 20) and collapsing 
when concentrated factors are depleted from 
chromatin (27-25) or when nucleation sites are 
deleted (26-29). These properties of SEs led to 
the proposal that the high-density assembly of 
biomolecules at active SEs is due to phase sep- 
aration of enriched factors at these genetic el- 
ements (30). Here we provide experimental 
evidence that the transcriptional coactivators 
BRD4 and MEDI (a subunit of the Mediator 
complex) form condensates at SEs. This estab- 
lishes a new framework to account for the diverse 


properties described for these regulatory ele- 
ments and expands the known biochemical pro- 
cesses regulated by phase separation to include 
the control of cell-identity genes. 


BRD4 and MEDI coactivators form 
nuclear puncta 


The enhancer clusters that make up SEs are oc- 
cupied by master TFs and unusually high den- 
sities of factors, including BRD4 and MEDI, that 
are coactivators (31-35) whose presence can be 
used to define SEs (/4, 15, 27). We reasoned that 
if BRD4 and MED1 are components of nuclear 
condensates, then they might be visualized as 
discrete puncta in the nuclei of cells, and the 
properties of these puncta could be investigated. 
Fixed cell immunofluorescence (IF) with anti- 
bodies against BRD4 and MED1 in murine em- 
bryonic stem cells (mESCs) revealed nuclear 
puncta for both factors (Fig. 1A). To determine 
whether such puncta occur in live cells, we en- 
gineered mESCs by using CRISPR-Cas9 to tag 
endogenous BRD4 and MED1 with monomeric 
enhanced green fluorescent protein (mEGFP) 
(fig. S1). Live-cell fluorescence microscopy of the 
engineered mESC lines also revealed discrete 
nuclear puncta (Fig. 1B). Analysis of these images 
showed that there were 1034 + 130 BRD4 and 
983 + 102 MED1 puncta per nucleus (means + 
SEM) (table S1). These results demonstrate that 
BRD4 and MEDI are components of puncta 
within the nuclei of mESCs. 


SEs are associated with 
coactivator puncta 


Several lines of evidence suggest that SEs are 
likely to be associated with some of the BRD4 
and MEDI puncta in mESCs. ChIP-seq (chromatin 
immunoprecipitation followed by sequencing) 
data for BRD4 and MED1 show that SEs are 
especially enriched in these coactivators (14, 15). 
DNA interaction data suggest that SE constituents 
occupied by BRD4 and MED! are in close spatial 
proximity to one another (Fig. 1C and fig. S2A). 
Co-occupancy of the genome by BRD4 and MED1 
is most evident at SEs (fig. S2B) (14, 15). To de- 
termine whether SEs are associated with some 
of the BRD4 and MEDI puncta, we performed 
IF for BRD4 or MEDI together with DNA-FISH 
or nascent RNA-FISH for the genomic region 
containing the Nanog gene and its SEs (FISH, 
fluorescence in situ hybridization) (Fig. 1, D to G). 
We found that BRD4 and MED1 puncta con- 
sistently overlapped the DNA-FISH foci (Fig. 1D) 
or RNA-FISH foci (Fig. 1F). An average image 
analysis (details are given in the methods) of the 
BRD4 and MED1 IF signals centered at DNA- 
FISH foci (n = 137 for BRD4 and 125 for MED1) 
and RNA-FISH foci (7 = 121 for BRD4 and 181 
for MEDI) revealed that, on average, BRD4 and 
MED/1 fluorescence intensities are most enriched 
at the center of FISH foci (Fig. 1, E and G); this 
trend was not observed for average images cen- 
tered at randomly selected nuclear positions 
(Fig. 1, E and G). Radial distribution functions 
of the averaged images for FISH and IF pairs show 
a significant correlation (Spearman correlation 
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coefficients > 0.6; P values < 1 x 10°"), with 
both BRD4 and MED1 having their highest sig- 
nal intensities at the center of the FISH focus; 
signals decay with distance from this center 
(fig. S3). The radial distributions of FISH and 


A Fixed cell imaging 
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MED1 


D DNA-FISH: 
Nanog 


BRD4 


MED1 


IF at randomly selected nuclear positions are 
not correlated (Spearman correlation coefficients 
< 0.2) (fig. S3). Similar results were obtained 
when we performed IF for BRD4 or MEDI to- 
gether with nascent RNA-FISH for the SE- 
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Fig. 1. BRD4 and MED1 form puncta at super-enhancers (SEs). 


-1.5um 


regulated genes Kif#, Mir290, and Trim28 (figs. 
S3 and S4, A to F). When a similar experiment 
was conducted for two genes expressed in mESCs 
but not associated with a SE (Fam168b and Zfp606), 
there was no evident overlap between FISH foci 
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(A) Immunofluorescence (IF) imaging of BRD4 and MED1in mouseembryonic in white). The blue line highlights the nuclear periphery, determined by Hoechst 


stem cells (mMESCs). Fluorescence signal is shown alone (left) and merged staining (not shown). The rightmost column shows the area in the yellow 
with Hoechst stain (right). (B) Live imaging of endogenously tagged box in greater detail. (E) Averaged signal of (left) DNA-FISH for Nanog and 
mEGFP-BRD4 and mEGFP-MED1 in mESCs. (C) Depiction of Nanog locus, (right) IF for BRD4 or MED1 centered at Nanog DNA-FISH foci or randomly 
associated SEs (black bars), DNA contacts (red arcs), BRD4 and MED1 selected nuclear positions. (F) Colocalization between BRD4 or MED1 
ChIP-seq (green histograms), and location of FISH probes. ChIA-PET, and the nascent RNA of Nanog, determined by IF and RNA-FISH in fixed 


chromatin interaction analysis with paired-end tag; RPM, reads per million. 


mESCs. Data are shown as in (D). (G) Averaged signal of (left) RNA-FISH 


(D) Colocalization between BRD4 or MED1 and the Nanog locus by IF and for Nanog and (right) IF for BRD4 or MED1 centered at Nanog RNA-FISH 


DNA-FISH in fixed mESCs. Separate images of the indicated IF and FISH are 
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Fig. 2. BRD4 and MED1 nuclear puncta exhibit properties expected 
for biomolecular condensates. (A) Representative images of the FRAP 
experiment with mEGFP-BRD4—engineered mESCs (where " indicates time 
in seconds). The yellow box highlights the punctum undergoing targeted 
bleaching. (B) Quantification of FRAP data for mEGFP-BRD4 puncta. 

The bleaching event occurs at t = Os. For the bleached area and the 
unbleached control, background-subtracted fluorescence intensities are 
plotted relative to a prebleach time point (t = —4 s). Data are plotted as 
means + SEM (n = 9). (C) Same as (A), but with mEGFP-MED1-engineered 


Sabari et al., Science 361, eaar3958 (2018) 27 July 2018 


B 
Post-bleach 8” 
< > 1.0 
oc 
Te 
6 o 06 
We 
e 2 04 
Z o 02 = Bleached 
= §s ; Unbleached 
c= 
0 
0 5 10 1) 
t(s 
D 
= > 1.0 
in = 
= 508 
oc 
Ls 
6 o 06 
we 
E904 
s 3 02 = Bleached 
3 s Unbleached 
c* 0 
0 5 10 15 
t(s) 
F 
Post-bleach 8” 
< > 1.0 
oc 
Te 
5 @ 0.6 
wee 
E904 
2% 
= s 02 — ATPdepl bleached 
z 2 ATPdep! unbleached 
0 
0 5 10 15 
t(s 
H (s) 
Post-bleach 8” 
= =—— ATPdepl bleached 
a 20 
wu = ATPdepl unbleached 
= 5 
BS 
im [S) 
E & 
@O 
2 
c= 


mESCs. (D) Same as (B), but for mEGFP-MED1 puncta (n = 9). 

(E) Representative images of the FRAP experiment with mEGFP-BRD4— 
engineered mESCs upon ATP depletion. (F) Quantification of FRAP data 
for mEGFP-BRD4 puncta upon ATP depletion (n = 8), as in (B). 

(G) Representative images of the FRAP experiment with mEGFP-MED1-— 
engineered mESCs upon ATP depletion. (H) Quantification of FRAP data 
for mEGFP-MED1 puncta upon ATP depletion (n = 8), as in (B). Images 
were taken using the Zeiss LSM 880 confocal microscope with an Airyscan 
detector and a 63x objective at 37°C. 
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Fig. 3. 1,6-hexanediol disrupts BRD4 and MED1 puncta and disrupts 
BRD4, MED1, and RNA Pol II occupancy at SEs and SE-driven genes. 
A) Representative images of mEGFP-BRD4— or mEGP-MED1-engineered 
mESCs before and after treatment with 3% hexanediol for 15 s. 

B) Box-plot representation of the fold change in the number of 
mEGFP-BRD4 or mEGFP-MED1 puncta observed after addition of 
vehicle or 1,6-hexanediol to a final concentration of 3%. (C) Genome 
browser view of BRD4 (blue), MED1 (red), and RNA Pol II (RNAPII, 
brown) ChIP-seq data from untreated or 1,6-hexanediol-treated 

1.5% for 30 min) mESCs at the K/f4 locus. The y axis shows reads 
per million. (D) Box-plot representation of the log2 fold change in 


and BRD4 puncta (fig. S4G). These results in- 
dicate that both BRD4 and MED1 puncta are 
present at SEs. 


Coactivator puncta exhibit liquid-like 
rates of fluorescence recovery 
after photobleaching 


We next sought to examine whether BRD4 and 
MED1 puncta exhibit features characteristic of 
liquid-like condensates. A hallmark of liquid-like 
condensates is internal dynamical reorganization 
and rapid exchange kinetics (J-3), which can be 
interrogated by measuring the rate of fluores- 
cence recovery after photobleaching (FRAP). To 
study the dynamics of BRD4 and MEDI foci in 
live cells, we performed FRAP experiments on 
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(transcription start site 
per kilobase per millio 


their log2 fold change 


endogenously tagged mEGFP-BRD4: or mEGFP- 
MEDI cell lines. After photobleaching, mEGFP- 
BRD4 and mEGFP-MED1 puncta recovered 
fluorescence on a time scale of seconds (Fig. 2, 
Ato D), with apparent diffusion coefficients of 
~0.37 + 0.13 and ~0.14 + 0.04 um?/s, respectively. 
These values are similar to those previously re- 
ported for components of liquid-like condensates 
(36, 37). Adenosine triphosphate (ATP) has been 
implicated in promoting condensate fluidity 
by driving energy-dependent processes and/or 
through its intrinsic hydrotrope activity (38, 39). 
Depletion of cellular ATP by glucose depriva- 
tion and oligomycin treatment altered fluores- 
cence recovery after photobleaching for both 
mEGFP-BRD4 and mEGFP-MEDI foci; the rate of 


BRD4 (blue), MED1 (red), and RNA Pol II (brown) ChIP-seq read density 
(1,6-hexanediol—-treated versus untreated) for regions defined as SEs 
or typical enhancers (TEs) (methods and table S2). (E) Box-plot 
representation of the log2 fold change in RNA Pol II ChIP-seq density 
(1,6-hexanediol—-treated versus untreated) within the gene body 


to transcription end site) of all active genes (reads 
n> 1), TE-associated genes, or SE-associated 


genes. (F) Gene set enrichment analysis, with genes ranked by 


in RNA Pol II ChIP-seq density within the gene 


body and annotated against the set of SE-associated genes. Enrichment 
score profile and the position of SE-associated genes are shown. 


recovery for MEDI was reduced, and the extent of 
recovery for BRD4 was diminished (Fig. 2, E to 
H). These results indicate that puncta contain- 
ing BRD4 and MED1 have liquid-like properties 
in cells, consistent with previously described 
phase-separated condensates. 


Coactivator puncta and SE occupancy 
are sensitive to condensate perturbation 


To further investigate the biophysical properties 
of BRD4 and MED! puncta, we investigated their 
sensitivity to 1,6-hexanediol, a compound known 
to disrupt liquid-like condensates, possibly by 
disruption of hydrophobic interactions (40). We 
found that treatment of mESCs expressing endo- 
genously tagged mEGFP-BRD4 or mEGFP-MED1 
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Fig. 4. Intrinsically disordered regions (IDRs) of BRD4 and MED1 
phase-separate in vitro. (A) Graphs plotting intrinsic disorder 

for BRD4 and MEDI. PONDR (Predictor of Natural Disordered Regions) 
VSL2 scores are shown on the y axis, and amino acid positions are 
shown on the x axis. The purple bar designates the IDR under 
investigation. (B) Schematic of recombinant mEGFP fusion proteins 
used in this study. Purple boxes indicate the IDRs of BRD4 and MED1 
shown in (A). (C) Visualization of turbidity associated with droplet 
formation. Tubes containing BRD4-IDR (left pair), MED1-IDR (middle 


pair), or GFP (right pair) in the presence (+) or absence (—) of PEG-8000 


are shown. Blank tubes are included between pairs for contrast. 


(D) Representative images of droplet formation at different protein 
concentrations. BRD4-IDR, MED1-IDR, or mEGFP were added 

to the droplet formation buffer to the final concentrations indicated. 
(E) Representative images of droplet formation at different salt 
concentrations. BRD4-IDR or MED1-IDR was added to droplet formation 
buffer to achieve 10 uM protein concentration with a final NaCl 
concentration as indicated. (F) Representative images of the droplet 
reversibility experiment with BRD4-IDR (top row) or MED1-IDR (bottom 
row) [20 uM protein and 75 mM NaCl (initial), followed by a 1:1 dilution 
(172 dilution) or a 1:1 dilution with an increase to 425 mM NaCl 

(172 dilution + NaCl)]. 


with 1,6-hexanediol caused a reduction in the 
number of BRD4 and MED1 puncta (Fig. 3, A 
and B). 

To determine the effect of 1,6-hexanediol on 
BRD4, MEDI, and RNA polymerase II (RNA Pol IT) 
occupancy at enhancers and genes, ChIP-seq was 
performed with antibodies against these proteins 
in untreated or 1,6-hexanediol-treated mESCs. 
Treatment with 1,6-hexanediol caused a reduc- 
tion in all three proteins at enhancers, with the 
most profound effects occurring at SEs (Fig. 3, 
Cand D, and fig. S5A). For example, at the K/f4 
SE, the levels of BRD4 were reduced by 44%, 
those of MED1 by 80%, and those of RNA Pol II 
by 56% (Fig. 3C). Similar effects were observed 
genome-wide, where reductions in BRD4, MED1, 
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and RNA Pol II were substantially larger at SEs 
than at typical enhancers (Fig. 3D), and the 
degrees to which BRD4 and MEDI were lost 
from SEs were positively correlated (fig. S5B). 
These results are consistent with the notion that 
BRD4 and MED! form condensates at SEs that 
are sensitive to 1,6-hexanediol. 

The level of RNA Pol II occupancy across gene 
bodies can be used as a measure of transcriptional 
output (47). The ChIP-seq data revealed that the 
reduction in BRD4 and MEDI occupancy at SEs 
was associated with a loss of RNA Pol IT occupancy 
across SE-associated gene bodies (Fig. 3, C and E, 
and fig. S5A). When genes were ranked by the 
extent to which RNA Pol II was lost upon 1,6- 
hexanediol treatment, SE-associated genes were 


highly enriched among those that lost the 
most RNA Pol II (Fig. 3F). These results are 
consistent with the idea that BRD4 and MED1 
condensates are associated with SEs and that 
loss of condensate integrity adversely affects 
transcription. 


IDRs of BRD4 and MED1 phase-separate 
in vitro 

BRD4 and MEDI contain large IDRs (Fig. 4A) 
and share features with the IDRs of several pro- 
teins known to facilitate condensate formation 
(2, 3), including high proline and glutamine con- 
tent (BRD4), high serine content (MEDI), and 
acidic and basic regions (BRD4 and MED1). The 
purified IDRs of several proteins involved in 
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Fig. 5. The IDR of MED1 participates in phase 
separation in cells. (A) Schematic of the 
optolDR assay, depicting recombinant protein 
with an IDR (purple), mCherry (red), and Cry2 
(orange) expressed in cells exposed to blue 
light. (B and C) Images of NIH3T3 cells 
expressing either (B) mCherry-Cry2 or 

(C) a portion of the MED1-IDR (amino acids 
948 to 1157) fused to mCherry-Cry2 
(MED1-optolDR). Cells were subjected to 
laser excitation every 2 s for the indicated 
times. (D) Time-lapse images of the nucleus 
of an NIH3T3 cell expressing MED1-optoIDR 
subjected to laser excitation every 2 s for 

the times indicated. A droplet fusion event 
occurs in the region highlighted by the yellow 
box. (E) The droplet fusion event highlighted 
in (D) at higher resolution and extended 
times as indicated. (F) Image of a MED1-optolIDR 
optoDroplet (yellow box) before (left), during 
middle), and after (right) photobleaching. 
The blue box highlights an unbleached 

region for comparison. Time relative to photo- 
bleaching (O s) is indicated. (G) Signal 
intensity relative to the prebleaching signal 

(y axis) versus time relative to photobleaching 
xX axis). Data are shown as average relative 
intensity SD (n = 15). (H) Time-lapse and 
close-up view of droplet recovery for regions 
highlighted in (F). Times relative to photo- 
bleaching are indicated. Scale bars, 1 um. 


condensate formation form phase-separated 
droplets in vitro (7, 36, 37, 42), so we investigated 
whether the IDRs of BRD4 or MED1 form such 
droplets in vitro. Purified recombinant mEGFP- 
IDR fusion proteins (BRD4-IDR and MED1-IDR) 
(Fig. 4B) were added to buffers containing 10% 
PEG-8000 (polyethylene glycol, molecular weight 
8000; materials and methods), turning the so- 
lution opaque, whereas equivalent solutions with 
mEGFP alone remained clear (Fig. 4C). Fluores- 
cence microscopy of the opaque MED1-IDR and 
BRD4-IDR solutions revealed GFP-positive, micron- 
sized spherical droplets freely moving in solution 
(Movies 1 and 2) and falling onto and wetting 
the surface of the glass coverslip, where they re- 
mained stationary (Movie 3). As determined by 
aspect ratio analysis, the MED1-IDR and BRD4- 
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IDR droplets were highly spherical (fig. S6A), a 
property expected for liquid-like droplets (J-3). 

Phase-separated droplets typically scale in size 
according to the concentration of components in 
the system (43). We performed the droplet forma- 
tion assay with varying concentrations of BRD4- 
IDR, MED1-IDR, and mEGFP, ranging from 0.625 
to 20 pM. BRD4-IDR and MED1-IDR formed drop- 
lets with concentration-dependent size distributions, 
whereas MEGFP remained diffuse in all conditions 
tested (Fig. 4D and fig. S6B). Although these droplets 
were smaller at lower concentrations, we observed 
BRD4-IDR and MED1-IDR droplets at the lowest 
concentration tested (0.625 uM) (fig. S6C). 

To investigate the biophysical properties of 
these droplets, we tested their ability to form 
under varying salt concentrations (to probe the 


contribution of electrostatic interactions) or upon 
1,6-hexanediol treatment (to probe the contribu- 
tion of hydrophobic interactions). The size dis- 
tributions of both BRD4-IDR and MED1-IDR 
droplets shifted toward smaller droplets with in- 
creasing NaCl concentration (from 50 to 350 mM) 
(Fig. 4E and fig. S6D), and opacity was reduced 
with 10% 1,6-hexanediol treatment (fig. S’7A). 
These results demonstrate that a variety of mo- 
lecular interactions contribute to BRD4-IDR and 
MED1-IDR droplet formation. 

We next sought to test whether the droplets 
are irreversible aggregates or reversible phase- 
separated condensates. To do this, BRD4-IDR 
and MED1-IDR were allowed to form droplets in 
an initial solution. The protein concentration 
was then diluted by half in equimolar salt or in 
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a higher salt solution (Fig. 4F). The preformed 
BRD4-IDR and MEDI1-IDR droplets were reduced 
in size and number with dilution and even further 
reduced with elevated salt concentration (Fig. 4F 
and fig. S7B). These results show that the BRD4- 
IDR and MED1-IDR droplets form a distribution 
of sizes that is dependent on the conditions of the 
system and, once formed, respond to changes in 
the system, with rapid adjustments in size. These 
features are characteristic of phase-separated con- 
densates formed by networks of weak protein- 
protein interactions (7-3). 


MED1-IDR participates in liquid-liquid 
phase separation in cells 


To investigate whether the coactivator IDRs 
facilitate phase separation in cells, we used a pre- 
viously developed assay to manipulate local pro- 
tein concentrations within the cell; this opto[IDR 
assay tests IDR-dependent, light-inducible drop- 
let formation in vivo (44). Briefly, the photo- 
activatable, self-associating Cry2 protein was 
labeled with mCherry and fused to an IDR of 
interest. This fusion mediates a blue light- 
inducible increase in the local concentration 
of selected IDRs within the cell (Fig. 5A) (44). 
In this assay, IDRs known to promote phase 
separation enhance the photoresponsive cluster- 
ing properties of Cry2 (45, 46), causing rapid 
formation of liquid-like spherical droplets under 
stimulation by blue light. Fusion of a portion of 
the MED1-IDR to Cry2-mCherry facilitated the 
rapid formation of micron-sized spherical drop- 
lets upon blue light stimulation (optoDroplets) 
(Fig. 5, B and C, and fig. S8). During stimula- 
tion, proximal droplets were observed to fuse 
(Fig. 5, D and E, and Movie 4). The fusions ex- 
hibited characteristic liquid-like fusion properties 
of necking and relaxation to spherical shape (Fig. 
5E). The MED1-IDR droplets persisted after blue 
light stimulation and exhibited liquid-like FRAP 
recovery rates in the absence of blue light stim- 
ulation (Fig. 5, F to H). The rapid FRAP kinetics 
in the absence of light-activated Cry2 interac- 
tions suggests that the MED1-IDR optoDroplets 
established by blue light are dynamic assemblies 
exchanging with the dilute phase. 


Conserved serine bias in the MED1-IDR 
is necessary for phase separation 


Previous studies have implicated low-complexity 
IDRs of proteins in liquid-liquid phase separa- 
tion (7, 36, 37, 42). An examination of the amino 
acid content of MEDI revealed that the IDR con- 
tains a compositional bias for serine (Fig. 6A). 
This serine compositional bias is conserved among 
vertebrates (Fig. 6B). To investigate whether 
the serine bias is necessary for the MED1-IDR’s 
capacity to phase-separate, we mutated all the 
serine (S) residues to alanine (A) and investigated 
the ability of this mutated IDR to form phase- 
separated droplets in vitro. The MEDI-IDR S-to-A 
mutant was incapable of forming phase-separated 
droplets under conditions in which the wild-type 
IDR readily formed droplets (Fig. 6C), indicating 
that the conserved serine bias in the MED1-IDR is 
necessary for droplet formation. 
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MED1-IDR droplets can incorporate 
proteins necessary for transcription 

A proposed function of phase separation at SEs is 
the ability to compartmentalize and concentrate 
factors within a biomolecular condensate, so we 


Movie 1. BRD4-IDR droplets in solution. Each 
frame represents 1 s. The movie is rendered at 
12 frames per second. 20 uM protein, 125 mM 
NaCl. Scale bar, 5 um. 


Movie 2. MED1-IDR droplets in solution. 
Each frame represents 1 s. The movie is 
rendered at 12 frames per second. 20 uM 
protein, 125 mM NaCl. Scale bar, 5 um. 


Movie 3. MED1-IDR droplets settling onto a 
glass coverslip. Each frame represents 1 s. The 
movie is rendered at 12 frames per second. 

10 uM protein, 125 mM NaCl. Scale bar, 5 um. 


sought to test whether MED1-IDR droplets could 
recapitulate this compartmentalization function 
in vitro. We identified conditions under which the 
MED1-IDR could form droplets but the BRD4-IDR 
could not (fig. S9). We then investigated whether 
the MEDI1-IDR droplets could compartmentalize 
BRD4-IDR protein under these conditions (Fig. 7A). 
Using differentially labeled proteins (mCherry- 
MED1-IDR and mEGFP-BRD4-IDR), we found 
that the MED1-IDR droplets could incorporate, 
and thus concentrate, the BRD4-IDR protein 
(Fig. 7A). The MED1-IDR droplets did not in- 
corporate mEGFP (Fig. 7A). To probe the ap- 
proximate mesh size of the MED1-IDR droplets 
(47), we incubated them with fluorescently labeled 
dextrans with average molecular weights of 4, 10, 
and 40 kDa. We found that the 4-kDa dextrans 
were incorporated into the MED1-IDR droplets, 
the 10-kDa dextrans were incorporated with less 
efficiency, and the 40-kDa dextrans were excluded 
(fig. S10). These results suggest that the incor- 
poration of mEGFP-BRD4-IDR (105 kDa) into the 
MED1-IDR droplet is due to attractive molecular 
interactions, as opposed to passive diffusion 
through the droplet mesh. 

We next investigated whether the MED1-IDR, 
introduced into a transcription-competent nuclear 
extract, would form droplets that might incor- 
porate BRD4 or other transcriptional compo- 
nents. We found that the wild-type MED1-IDR, 
but not the MEDI-IDR S-to-A mutant, formed 
droplets in these extracts (Fig. 7B). The MED1-IDR 
phase-separated droplets were denser than the 
surrounding extract and thus could be purified 
from solution by centrifugation. Immunoblot 
analysis revealed that BRD4 and the largest sub- 
unit of RNA Pol II (RPB1) were enriched in pel- 
leted droplets in a MED1-IDR dose-dependent 
manner (Fig. 7C). These results indicate that the 
MED1-IDR droplets can incorporate BRD4 and 
RNA Pol II. 

The ability of the MED1-IDR protein to in- 
corporate BRD4 and RNA Pol II into an artificial 
phase-separated compartment suggests that it 
sequesters key components of the transcription 
apparatus and might thus “squelch” transcription 
in the nuclear extract. We carried out an in vitro 
transcription assay with these extracts and found 
that the wild-type MED1-IDR protein does squelch 
transcription, correlating with the amount of 
material separated from solution by the MED1- 
IDR droplets (Fig. 7D). We did not observe these 
effects with equivalent concentrations of mEGFP 
or with the MED1-IDR S-to-A mutant (Fig. 7D). 
These results demonstrate that the MED1-IDR 
has the capacity to compartmentalize and con- 
centrate transcriptional machinery from a com- 
plex nuclear extract. 


Discussion 


SEs regulate genes with prominent roles in healthy 
and diseased cellular states (14, 15, 19-25, 48, 49). 
SEs and their components have been proposed 
to form phase-separated condensates (30), but 
with no direct evidence. Here we demonstrate 
that two key components of SEs, BRD4 and MED1, 
form nuclear condensates at sites of SE-driven 
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Fig. 6. Conserved serine bias is necessary for MED1-IDR phase sepa- 
ration. (A) Amino acid composition of the MED1 protein. Each row 
represents information for a single amino acid. Single-letter amino code 
abbreviations (right) are as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, 
Gly; H, His; |, lle; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; 
T, Thr; V, Val; W, Trp; and Y, Tyr. The length of the row corresponds to the 
length of the MED1 protein. Black bars represent the occurrence of the 


transcription. Within these condensates, BRD4 
and MED1 exhibit apparent diffusion coefficients 
similar to those previously reported for other 
proteins in phase-separated condensates in vivo 
(36, 37). The IDRs of both BRD4 and MEDI are 
sufficient to form phase-separated droplets in vitro, 
and the MEDI-IDR facilitates phase separation 
in living cells. Droplets formed by MED1-IDR are 
capable of concentrating transcriptional machinery 
in a transcriptionally competent nuclear extract. 
These results support a model in which tran- 
scriptional coactivators form phase-separated 
condensates that compartmentalize and concen- 
trate the transcription apparatus at SE-regulated 
genes and identify SE components that likely 
play a role in phase separation. 

SEs are established by the binding of master 
TFs to enhancer clusters (1/4, 15). These TFs typ- 
ically consist of a structured DNA-binding domain 
and an intrinsically disordered transcriptional 
activation domain (50-52). The activation do- 
mains of these TFs recruit high densities of many 
transcription proteins, which, as a class, are 
enriched for IDRs (53). Although the exact 
client-scaffold relationship (54) between these 
components remains unknown, it is likely that 
these protein sequences mediate weak multi- 
valent interactions, thereby facilitating conden- 
sation. We propose that condensation of such 
high-valency factors at SEs creates a reaction 
crucible within the separated dense phase, where 
high local concentrations of the transcriptional 
machinery ensure robust gene expression. 

The nuclear organization of chromosomes is 
likely influenced by condensates at SEs. DNA 
interaction technologies indicate that the individual 
enhancers within the SEs have exceptionally high 
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and 10% Ficoll-400). 


interaction frequencies with one another (J6-18), 
consistent with the idea that condensates draw 
these elements into close proximity in the dense 
phase. Several recent studies suggest that SEs 
can interact with one another and may also con- 
tribute in this fashion to chromosome organization 
(55, 56). Cohesin, an SMC (structural maintenance 
of chromosomes) protein complex, has been im- 
plicated in constraining SE-SE interactions be- 
cause its loss causes extensive fusion of SEs within 


Movie 4. Formation of MED1-IDR optoDroplets 
upon stimulation with blue light. NIH3T3 cells 
expressing the MED1-optolDR construct were 
subjected to 488-nm laser light in 2-s intervals. 
Each frame represents 2 s. The movie is rendered 
at 12 frames per second. Scale bar, 5 um. 
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1pm 
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indicated amino acid at that position in MED1. The purple bar represents 
the IDR of MED1 under investigation. (B) Serine composition of MED1 
protein from indicated organisms, presented as in (A). (©) Mutating all 
serines to alanine (S to A) disrupts phase separation. Representative 
images of wild-type MED1-IDR or MED1-IDR S-to-A mutant fused to 
mEGFP in the droplet formation assay (10 uM protein, 125 mM NaCl, 


the nucleus (56). These SE-SE interactions may 
be due to a tendency of liquid-phase condensates 
to undergo fusion (7-3). 

The model whereby phase separation of co- 
activators compartmentalizes and concentrates 
the transcription apparatus at SEs and their 
regulated genes, described here and corroborated 
by (57), raises many questions. How does con- 
densation contribute to regulation of transcrip- 
tional output? A study of RNA Pol II clusters, 
which may be phase-separated condensates, 
suggests a positive correlation between conden- 
sate lifetime and transcriptional output (58). What 
components drive formation and dissolution of 
transcriptional condensates? Our studies indi- 
cate that BRD4 and MEDI likely participate, 
but the roles of DNA-binding TFs, RNA Pol II, 
and regulatory RNAs require further study. Why 
do some proteins, such as HPla, contribute to 
phase-separated heterochromatin condensates 
(59, 60) and others contribute to euchromatic 
condensates? The rules that govern partitioning 
into specific types of condensates have begun to 
be studied (67-65) and will need to be defined for 
proteins involved in transcriptional condensates. 
Does condensate misregulation contribute to 
pathological processes in disease, and will new 
insights into condensate behaviors present new 
opportunities for therapy? Mutations within 
IDRs and misregulation of phase separation 
have already been implicated in a number of 
neurodegenerative diseases (66-68). Tumor 
cells have exceptionally large SEs at driver on- 
cogenes that are not found in their cell of origin, 
and some of these are exceptionally sensitive 
to drugs that target SE components (22-25). 
How do we take advantage of phase separation 
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Fig. 7. MED1-IDR 
droplets compart- 
mentalize and 
concentrate proteins 
necessary for 
transcription. 

(A) MED1-IDR 
droplets incorporate 
BRD4-IDR protein in 
vitro. The indicated 
mEGFP or mCherry 
fusion proteins were 
mixed at 10 uM each 
in buffer D containing 
10% Ficoll-400 and 
125 mM NaCl. 
Indicated fluores- 
cence channels are 
presented for each 
mixture. Illustrations 
summarizing results 
are shown on the 
right. (B) MED1-IDR 
forms droplets in 

an in vitro transcription 
reaction containing 
HeLa cell nuclear 
extract, whereas the 
MEDL-IDR S-to-A 
mutant does not. 
Shown are represent- 
ative images of the 
indicated mMEGFP- 
fusion protein when 
added to an in vitro 
transcription reaction 
containing HeLa cell 
nuclear extract at a 


A 


Illumination: mEGFP 


Pellets of droplets from 
HeLa cell nuclear extract 


MED1-IDR(uM) - 2 4 10 - - 
S-to-Amutant(uM) - - - - 10 - 
mEGFP (uM) - - - - - 10 


123 4 5 6 


mCherry 


DR 


mEGFP-BRD4-| 


mCherry-MED1-IDR mCherry-MED1-IDR 
mEGFP 


Immunoblot 


D in vitro transcription 


MED1-IDR (uM) - 2 
S-to-A mutant(uM) - - - - 


410 -- -- - 
24 10- - 


° mEGFP B 
e mCherry 

— ~ MED1-IDR 

—~ BRD4-IDR 


HeLa cell nuclear extract 


merge (3 mg/ml) 


mEGFP-MED1-IDR 


S-to-A mutant 


Noe 
° 
mEGFP-MED1-IDR 


Droplet formation in nuclear extract 


nuclear extract 
+MED1-IDR-mutant 


nuclear extract 
+MED1-IDR 


nuclear extract 


Droplet formation and in vitro transcription in nuclear extract 


nuclear extract 
+MED1-IDR-mutant 


nuclear extract 
+MED1-IDR 


nuclear extract 


final concentration of 
3 mg/ml (a complete 
list of components is 
given in the methods). 
(C) MED1-IDR droplets 
compartmentalize 
transcriptional machin- 
ery from a nuclear 
extract. Shown are 


pA, MLA53 
(RNA) 


1 


immunoblots of the pellet fraction of the indicated protein added to in 
vitro transcription reactions [as described in (B)]. A proposed model of 
molecular interactions taking place within MED1-IDR droplets in the 
nuclear extract is illustrated on the right. (D) MED1-IDR droplets 
compartmentalize machinery necessary for the in vitro transcription 
reaction. An autoradiograph of radiolabeled RNA products of in vitro 


principles established in physics and chemistry 
to more effectively improve our understanding 
of this form of regulatory biology? Addressing 
these questions at the crossroads of physics, chem- 
istry, and biology will require collaboration across 
these diverse sciences. 


Methods summary 


Immunofluorescence against BRD4 and MEDI, 
coupled with DNA-FISH or RNA-FISH against 
SEs or SE-driven nascent transcripts, was per- 
formed in mESCs to visualize the colocaliza- 
tion between BRD4 or MED1 puncta and SEs. 
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mEGFP (uM) _ . 


23 4 5 6 7 8 9 10 


BRD4 and MEDI were endogenously tagged with 
mEGFP in mESCs to visualize the organization 
of BRD4 and MEDI and to study their dynamics 
by FRAP and drug treatments in live cells. ChIP- 
seq was performed to investigate the effect of 1,6- 
hexanediol treatment on the chromatin occupancy 
of BRD4, MED1, and RNA Pol II. Recombinant 
BRD4-IDR and MED1-IDR were purified to test 
their capacity to phase-separate in vitro. The 
optoIDR assay (45) was implemented to test the 
capacity of a section of MEDI-IDR to phase- 
separate in live cells. Mutations were introduced 
into MED1-IDR to study the sequence determi- 


27 July 2018 


transcription reactions under indicated conditions is shown on the 
left. The arrow indicates the expected RNA product. Reactions 
were conducted as in (68) with minor modifications (full details are 
given in the methods). 
taking place within MED1-IDR droplets in nuclear extract and the impact 
on the in vitro transcription reaction is illustrated on the right. 


A proposed model of molecular interactions 


nants of MED1-IDR phase separation. BRD4-IDR 
and MED1-IDR fused to different fluorescent 
tags were used to demonstrate the capacity of 
MED1-IDR droplets to compartmentalize and 
concentrate BRD4-IDR. Formation of MED1-IDR 
droplets in a transcriptionally competent nuclear 
extract was used to study the ability of MED1-IDR 
droplets to compartmentalize and concentrate 
BRD4 and RNA Pol II from a complex extract. 
In vitro transcription assays were used to mea- 
sure the effect of synthetic droplet formation 
on transcription. All procedures are described 
in detail in the supplementary materials. 
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Three-dimensional intact-tissue 
sequencing of single-cell 
transcriptional states 
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Jia Liu, Garry P. Nolan}, Felice-Alessio Bava}, Karl Deisserotht 


INTRODUCTION: Single-cell RNA sequencing 
has demonstrated that both stable cell types 
and transient cell states can be discovered and 
defined by transcriptomes. In situ transcrip- 
tomic methods can map both RNA quantity 
and position; however, it remains challenging 
to simultaneously satisfy key technological re- 
quirements such as efficiency, signal intensity, 
accuracy, scalability to large gene numbers, and 
applicability to three-dimensional (3D) volumes. 
Well-established single-molecule fluorescence 
in situ hybridization (FISH) approaches (such 
as MERFISH and seqFISH) have high detection 
efficiency but require long RNA species (more 
than 1000 nucelotides) and yield lower intensity 
than that of enzymatic amplification methods 
(tens versus thousands of fluorophores per RNA 


molecule). Other pioneering in situ sequencing 
methods (via padlock probes and fluorescent 
in situ sequencing) use enzymatic amplification, 
thus achieving high intensity but with room 
to improve on efficiency. 


RATIONALE: We have developed, validated, 
and applied STARmap (spatially-resolved tran- 
script amplicon readout mapping). STARmap 
begins with labeling of cellular RNAs by pairs 
of DNA probes followed by enzymatic amplifica- 
tion so as to produce a DNA nanoball (amplicon), 
which eliminates background caused by mis- 
labeling of single probes. Tissue can then be 
transformed into a 3D hydrogel DNA chip by 
anchoring DNA amplicons via an in situ- 


synthesized polymer network and removing 
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STARmap for 3D transcriptome imaging and molecular cell typing. STARmap is an in situ 
RNA-sequencing technology that transforms intact tissue into a 3D hydrogel-tissue hybrid and 
measures spatially resolved single-cell transcriptomes in situ. Error- and background-reduction 
mechanisms are implemented at multiple layers, enabling precise RNA quantification, spatially 
resolved cell typing, scalability to large gene numbers, and 3D mapping of tissue architecture. 
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proteins and lipids. This form of hydrogel- 
tissue chemistry replots amplicons onto an op- 
tically transparent hydrogel coordinate system; 
then, to identify and quantify RNA species- 
abundance manifested by DNA amplicons, the 
identity of each species is encoded as a five-base 
barcode and read out by means of an in situ 
sequencing method that decodes DNA sequence 
in multicolor fluorescence. Using a new two-base 
sequencing scheme (SEDAL), STARmap was 
found to simultaneously detect more than 1000 
genes over six imaging cycles, in which sequenc- 
ing errors in any cycle cause misdecoding and 
are effectively rejected. 


RESULTS: We began by (i) detecting and quan- 
tifying a focused 160-gene set (including cell 
type markers and activity-regulated genes) simul- 
taneously in mouse primary visual cortex; (ii) 


clustering resulting per-cell 
gene expression patterns 
into a dozen distinct inhib- 


Read the full article 
at http://dx.doi. 
org/10.1126/ 
science.aat5691 


itory, excitatory, and non- 
neuronal cell types; and 
(iii) mapping the spatial 
distribution of all of these 
cell types across layers of cortex. For validation, 
per-cell-type gene expression was found to cor- 
relate well both with in situ hybridization re- 
sults and with single-cell RNA sequencing, and 
widespread up-regulation of activity-regulated 
genes was observed in response to visual stim- 
ulation. We next applied STARmap to a higher 
cognitive area (the medial prefrontal cortex) and 
discovered a more complex distribution of cell 
types. Last, we extended STARmap to much larger 
numbers of genes and spatial scales; we mea- 
sured 1020 genes simultaneously in sections— 
obtaining results concordant with the 160-gene 
set—and measured 28 genes across millimeter- 
scale volumes encompassing ~30,000 cells, re- 
vealing 3D patterning principles that jointly 
characterize a broad and diverse spectrum of 
cell types. 


CONCLUSION: STARmap combines hydrogel- 
tissue chemistry and in situ DNA sequencing 
to achieve intact-tissue single-cell measurement 
of expression of more than a thousand genes. 
In the future, combining this intact-system gene 
expression measurement with complementary 
cellular-resolution methodologies (with which 
STARmap is designed to be compatible)— 
including in vivo activity recording, optogenetic 
causal tests, and anatomical connectivity in the 
same cells—will help bridge molecular, cellular, 
and circuit scales of neuroscience. 


The list of author affiliations is available in the full article online. 
*These authors contributed equally to this work. 
{Corresponding author. Email: deissero@stanford.edu (K.D.); 
alessio.bava@gmail.com (F.-A.B.); gnolan@stanford.edu (G.P.N.) 
Cite this article as X. Wang et al., Science 361, eaat5691 (2018). 
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Retrieving high-content gene-expression information while retaining three-dimensional (3D) 
positional anatomy at cellular resolution has been difficult, limiting integrative understanding of 
structure and function in complex biological tissues. We developed and applied a technology 
for 3D intact-tissue RNA sequencing, termed STARmap (spatially-resolved transcript amplicon 
readout mapping), which integrates hydrogel-tissue chemistry, targeted signal amplification, 
and in situ sequencing. The capabilities of STARmap were tested by mapping 160 to 1020 genes 
simultaneously in sections of mouse brain at single-cell resolution with high efficiency, 
accuracy, and reproducibility. Moving to thick tissue blocks, we observed a molecularly defined 
gradient distribution of excitatory-neuron subtypes across cubic millimeter—scale volumes 
(>30,000 cells) and a short-range 3D self-clustering in many inhibitory-neuron subtypes that 
could be identified and described with 3D STARmap. 


n biological tissues, diversity of function 

arises from diversity of form—in part, via the 

complexity of cell-specific gene expression, 

which defines the distinct three-dimensional 

(3D) molecular anatomy and cellular proper- 
ties of each tissue. In situ transcriptomic tools 
for the spatial mapping of gene expression with 
subcellular resolution have emerged that may 
be applicable to probing these tissue structure- 
function relationships, including both multiplexed 
in situ RNA hybridization and in situ RNA se- 
quencing (7-10). Current in situ sequencing ap- 
proaches face the challenge of implementing 
enzymatic reactions in the dense, complex tissue 
environment and currently suffer from low efficien- 
cy (2), but the potential value of such intact-tissue 
sequencing could be enormous; in comparison 
with hybridization-based multiplexing/readout, 
which uses multiple polynucleotide probes to 
encode gene identity (3-5), sequencing operates 
with single-nucleotide resolution and thus inher- 
ently provides greater information. In addition, 
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in situ sequencing methods typically use signal 
amplification, which is important for the detec- 
tion of short transcripts (such as neuropeptides) 
and for high-quality imaging in thick tissue 
blocks. However, current sequencing methods 
have not yet been successfully applied to 3D 
volumes of intact tissue because of fundamental 
limitations in requisite sensitivity, fidelity, and 
scalability for throughput in tissues such as the 
mammalian brain. 

Hydrogels have been widely used for extra- 
cellular 3D scaffolding in applications across 
biology and medicine (77-13). Recently developed 
hydrogel-tissue chemistry (HTC) methodologies 
(14), beginning with CLARITY (15), physically 
link in situ-synthesized polymers with selected 
intracellular biomolecules. This process transforms 
the tissue, from within its constituent cells, into 
a new state suitable for high-resolution volu- 
metric imaging and analysis compatible with 
many kinds of molecular phenotyping for pro- 
teins, nucleic acids, and other targets (15). HTC- 
based hydrogel-embedding strategies have been 
extended to nucleic acid analyses in the form 
of in situ hybridization for RNA (J6-19), but 
these have not yet been extended to in situ RNA 
sequencing—which would have the potential to 
reveal the full molecular complexity of the tran- 
scriptome. In nontissue environments, however, 
purely synthetic hydrogels have been used to 
accommodate enzymatic reactions that include 
DNA sequencing (20), and if biological tissue 
could be converted into a hydrogel-embedded 
form compatible with creation, retention, and 
functional presentation of RNA-derived or hybri- 
dized complementary DNA (cDNA), it might be 


possible to perform 3D in situ sequencing with- 
in such a tissue-hydrogel formulation—leveraging 
the crucial attendant properties of optical trans- 
parency, reduced background, elevated diffusion 
rate, and greater mechanical stability. We achieved 
this goal with the development and application 
of a sequencing-based method (spatially-resolved 
transcript amplicon readout mapping, or STARmap) 
for targeted 3D in situ transcriptomics in intact 
tissue (Fig. 1A); using STARmap, we were able 
to identify organizational principles of a full spec- 
trum of cell types, which would not have been 
otherwise accessible for identification in the adult 
mammalian brain. 


Results 
Design and validation of 
STARmap principles 


One component is an efficient approach for in 
situ amplification of a library of cDNA probes 
hybridized with cellular RNAs (this approach 
is termed SNAIL, for specific amplification of 
nucleic acids via intramolecular ligation). Reverse 
transcription may be the major efficiency-limiting 
step for in situ sequencing (7, 27), and SNAIL 
bypasses this step with a pair of primer and 
padlock probes (fig. S1A) designed so that only 
when both probes hybridize to the same RNA 
molecule can the padlock probe be circularized 
and rolling-circle-amplified to generate a DNA 
nanoball (amplicon) that contains multiple copies 
of the cDNA probes (Fig. 1, A to D). This mech- 
anism ensures target-specific signal amplification 
and excludes noise that invariably otherwise 
arises from nonspecific hybridization of single 
probes. The outcome includes much higher ab- 
solute intensity and signal-to-noise ratio (SNR) 
as compared with those of commercial single- 
molecule fluorescent in situ hybridization (smFISH) 
probes (fig. S1, B to F) and substantial improve- 
ment of detection efficiency (comparable with that 
of single-cell RNA sequencing), with simplified 
experimental procedures compared with previous 
in situ RNA sequencing methods (fig. $1, G to 1. 

To enable cDNA amplicon embedding in the 
tissue-hydrogel setting, amine-modified nucleo- 
tides were spiked into the rolling-circle amplifi- 
cation reaction, functionalized with an acrylamide 
moiety by using acrylic acid N-hydroxysuccinimide 
esters, and copolymerized with acrylamide mono- 
mers so as to form a distinct kind of hydrogel- 
DNA amplicon network (Fig. 1A and fig. S2A). 
The resulting tissue-hydrogel was then subjected 
to protein digestion and lipid removal in order to 
enhance transparency (fig. S2, B to E). This de- 
sign chemistry dictates that amplicons are co- 
valently linked with the hydrogel network, and 
such cross-linking is essential to maintain the 
position and integrity of the amplicons through 
many cycles of detection (fig. S2, F to H). 

A five-base barcode (library size of 1024) was 
designed and built into each padlock probe as a 
gene-unique identifier to be sequenced, thus 
enabling multiplexed gene detection (Fig. 1A). 
Sequencing-by-synthesis paradigms were avoided 
because these require elevated reaction tem- 
peratures, which in turn are problematic for 
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Fig. 1. STARmap principles: in situ RNA sequencing for spatial tran- 
scriptomics within the 3D tissue environment. (A) STARmap overview 
schematic. After brain tissue is prepared (mouse brain protocols are 
available in the supplementary materials, materials and methods), the 
custom SNAIL probes that encounter and hybridize to intracellular mRNAs 
(dashed lines) within the intact tissue are enzymatically replicated as 
cDNA amplicons. The amplicons are constructed in situ with an acrylic 
acid N-hydroxysuccinimide moiety modification (blue) and then copolym- 
erized with acrylamide to embed within a hydrogel network (blue wavy 
lines), followed by clearance of unbound lipids and proteins (fig. S2). Each 
SNAIL probe contains a gene-specific identifier segment (red) that is 
read-out through in situ sequencing with two-base encoding for error 
correction (SEDAL) (fig. S3). Last, highly multiplexed RNA quantification in 
three dimensions reveals gene expression and cell types in space. 

(B) SNAIL logic. A pair of primer and padlock probes amplifies target- 
specific signals and excludes noise known to commonly arise from 
nonspecific hybridization of a single probe. (C and D) Only adjacent binding 
of primer and padlock probes leads to signal amplification. MRNA A 
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represents Gapdh, and mRNA B represents Actb. Both fluorescent images 
show Gapdh (gray) MRNA and cell nuclei (blue) labeling in mouse brain 
slice; there is an absence of labeling with mismatched primer and padlock 
(right). Scale bar, 10 um. (E) In situ sequencing of DNA amplicons in the 
tissue-hydrogel complex via SEDAL, the sequencing-by-ligation method 
devised for STARmap. For each cycle, the reading probes (gray line without 
star-symbol label) contain an incrementally increasing-length run of 
degenerate bases (N representing an equal mixture of A, T, C, and G) with 
phosphate at the 5’ end (5’P) to set the reading position; the decoding 
probes (gray line with star-symbol label) are labeled by fluorophores with 
color coding for the dinucleotide at the 3’ end. Only if both probes are 
perfectly complementary to the DNA template (black lower sequence) can 
the two kinds of probes then be ligated to form a stable product with a 
high melting temperature, allowing later imaging after unligated probes are 
washed away. After each imaging cycle, probes are stripped away from 
the robust tissue-hydrogel by using 60% formamide so that the next cycle 
can begin. X, unknown base to be read; red underline, decoded sequence; 
Chl to Ché4, fluorescence channels. Scale bar, 2 um. 
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Fig. 2. STARmapping cell A 
types in V1. (A) Experimental 
design. Mice were dark housed, 
before sacrifice, for 4 days 

and then either kept in the dark 

or exposed to light for 1 hour. 

V1 was coronally sectioned, and 
RNAs of 112 cell type markers ne 
and 48 activity-regulated genes 


V1 cortex 
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across all six cycles (bottom). ne = 300 
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11.78 by 11.78 um, scale bar, 0 
2 um; Channel, color code for 

the four fluorescence channels; D 
L1 to L6, the six neocortical 
ayers; Cc, corpus callosum; 
HPC, hippocampus. (C) Histo- 
grams. Shown are detected 
reads (DNA amplicons) per cell 
left), and genes per cell 

right). (D) Quantitative 
reproducibility of biological 
replicates, whether in the 

ight or dark condition: logs 
amplicon quantity) for E 
160 genes across the whole 
imaging region plotted. Repl, 
expression value in first 
replicate; rep2, expression 
value in second replicate. 

(E) Validation of STARmap. 
(Left) in situ images from Allen 
Institute of Brain Science 
(AIBS). (Right) RNA pattern 

of individual genes extracted 
from 160-gene STARmap, 
which reliably reproduced the 
spatial gene expression pattern 
from AIBS. (F) Uniform 
manifold approximation plot 
(UMAP), a nonlinear dimen- 
sionality reduction technique 
used to visualize the similarity 
of cell transcriptomes in two 
dimensions, showing consistent 
clustering of major cell types 
across 3142 cells pooled from 
four biological replicates: 

2199 excitatory neurons, 

324 inhibitory neurons, and 
619 non-neuronal cells. 

(G) Gene expression heatmap 
for 112 cell-type markers 
aligned with each cell cluster, showing clustering by inhibitory, excitatory, or 
non-neuronal cell types. Expression for each gene is z-scored across all genes 
in each cell. (H) Representative cell-resolved spatial map in neocortex and 
beyond. Cell types are color-coded as in (F). (I to N) Clustering of excitatory 
and inhibitory subtypes. [(1) and (L)] UMAP plots, [(J) and (M)] bar plots of 
representative genes (mean + 95% confidence interval expression across all 
cells in that cluster, with each bar scaled to the maximum mean expression 
across all clusters), and [(K) and (N)] in situ spatial distribution of [(I) to (K)] 
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excitatory and [(L) to (N)] inhibitory neurons. The number of cells in each 
cluster was as follows: eL2/3, 589; eL4, 649; eL5, 393; eL6, 368; PV neurons, 
111; VIP neurons, 46; SST neurons, 46; and NPY neurons, 56. Inclusion of 
cells in clusters was guided entirely by amplicon representation in each 

cell without using spatial information; excitatory cell clusters were then 
named according to the spatial layering observed for that cluster, whereas 
inhibitory cell clusters were named according to the dominant cell-type 
amplicon based on the strong segregation of amplicon markers. 
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Fig. 3. STARmapping behavioral experience: Detecting and quantifying cell type-specific 
regulation of ARGs. (A) Validation. Shown is spatial expression pattern in the visual cortex of 
prototypical ARGs known as immediate early genes. Sacrifice was in darkness or after 1 hour of light 
exposure. (B and C) Volcano plots of log fold-change in gene expression between light and dark 
conditions in inhibitory and excitatory cell types. Genes with significantly increased or decreased 
expression (false discovery rate—adjusted P < 0.05, Wilcoxon rank-sum test) are labeled in green, 
and the most significantly changed genes (P < 0.05 and fold change > 2) are labeled in red. Many 
ARGs showed cell-type specificity, pointing to discovery of unanticipated cell type-specific logic 

of excitation-transcription coupling. (D) Violin plot of Egr2 expression by cell type. ****P < 0.0001, 


n.s. not significant, Wilcoxon rank-sum test; red- 


high-resolution imaging and sample stability (16) 
in comparison with sequencing-by-ligation methods 
that can be implemented at room temperature. 
However, none of the reported or commercially 
available sequencing-by-ligation methods exhibit 
the necessary SNR or accuracy for this challeng- 
ing intact-tissue application: Supported Oligo 
Ligation Detection (SOLID) sequencing causes 
strong background fluorescence in biological 
samples (10), whereas combinatorial probe 
anchor ligation (cPAL) sequencing (22) lacks 
an error-rejection mechanism (fig. $3). For 
this reason, an approach we term sequencing 
with error-reduction by dynamic annealing and 
ligation (SEDAL) was devised specifically for 
STARmap (fig. S3). 

SEDAL uses two kinds of short, degenerate 
probes: reading probes to decode bases, and fluo- 
rescence probes to transduce decoded sequence 
information into fluorescence signals. The two 
short probes only transiently bind to the target 
DNA and ligate to form a stable product for 
imaging only when a perfect match occurs; after 
each cycle corresponding to a base readout, the 
fluorescent products are stripped with form- 
amide, which eliminates error accumulation 
as sequencing proceeds (Fig. 1E and fig. S3B). 
In contrast to SOLID, SEDAL exhibits minimal 
background (fig. $3, C to F). A two-base encod- 
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ing scheme was designed and implemented in 
order to mitigate any residual errors related to 
imaging high densities of spots (fig. $3, G and 
H). On the basis of a panel of four very highly 
expressed test genes in mouse brain (to mimic 
amplicon crowdedness as would be encountered 
in highly multiplexed gene-detection), we found 
that the error rate of STARmap was more than 
an order of magnitude lower than prior cPAL 
methods (~1.8 versus 29.4%) (fig. S3, I to L) (17). 


Spatial cell typing in primary visual 
cortex with 160-gene STARmapping 


To test whether STARmap could deliver on the 
initial goal of high-content 3D intact-tissue se- 
quencing of single-cell transcriptional states with 
the necessary sensitivity and accuracy, we ap- 
plied STARmap to a pressing challenge in neuro- 
science: detecting and classifying cell types and 
corresponding tissue-organization principles 
in the neocortex of the adult mouse brain. The 
anatomy and function of the mouse primary 
visual neocortex have been extensively studied 
(23), a setting which here allows validation of 
our results by comparison with prior findings 
that span multiple papers, methodologies, and 
data sources (but the full diversity of deeply 
molecularly defined cell types within the visual 
cortex has not yet been spatially resolved in a 
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single experiment, precluding identification of 
potentially fundamental joint statistics and orga- 
nizational principles across 3D volumes). Among 
many examples of the experimental leverage such 
information could provide, joint 3D cell-typology 
mapping might be used to help decode the 
spatiotemporal logic of neural activity-triggered 
gene expression as a function of cell type and 
spatial location. 

We therefore used five-base barcoded SNAIL 
probes over six rounds of in situ SEDAL sequenc- 
ing in coronal mouse brain slices (Figs. 1A and 
2, A and B) to survey a large but focused and 
curated gene set [160 genes including 112 pu- 
tative cell-type markers collated from mouse 
cortical single-cell RNA sequencing (24, 25) 
and 48 activity-regulated genes (ARGs) (26, 27)]. 
In one arm of the experiment, visually evoked 
neural activity was provided to a cohort of mice 
via 1 hour of light exposure after 4 days of hous- 
ing in the dark; other mice were kept con- 
tinuously in the dark (27, 28). Eight-um-thick 
volumes containing up to 1000 cells covering 
all cortical layers were imaged. After six rounds 
of sequencing, fluorescent Nissl staining was 
used to segment cell bodies, allowing attribution 
of amplicons to individual cells (fig. S4, A and B). 
The values corresponding to amplicons-per-cell 
and genes-per-cell varied substantially (Fig. 2C), 
whereas the 160-gene expression pattern was 
consistent between biological replicates [corre- 
lation coefficient (7) = 0.94 to 0.95] (Fig. 2D), 
revealing reliable detection of transcript diversity 
at the single-cell level. Because only 160 genes 
were encoded out of the 1024 possible barcodes 
from five bases, we were able to quantify se- 
quencing errors that resulted in sequences being 
corrupted from the 160 true barcodes to the 
864 invalid barcodes, which was remarkably 
low at 1 to 4%. We found that this 160-gene 
pilot faithfully reproduced the spatial distri- 
bution of known cortical layer markers and in- 
terneurons, illustrated here via comparison of 
in situ images from paired public atlases (29) 
and STARmap results (Fig. 2E). 

We next performed cell classification using 
expression data of the 112 cell-type markers. 
First, >3000 cells pooled from four biological 
replicates were clustered into three major cell 
types (excitatory neurons, inhibitory neurons, 
and non-neuronal cells) by using graph-based 
clustering after principal-component decompo- 
sition (30) and then further subclustered under 
each category (Fig. 2, F to H, and fig. S4C). The 
richly defined excitatory neurons segregated into 
four major types (here denoted eL2/3, eL4, eL5, 
and eL6) by spatial correspondence with anatomic 
cortical layers and expression profiles of known 
layer-specific gene markers (Fig. 2, I to K, and 
fig. S5, A and B). Although spatial organiza- 
tion of the four excitatory types exhibited a 
layered pattern, there was extensive intermix- 
ing among different cell types within each layer. 
Inhibitory neurons were also clustered into four 
major types, here denoted by the dominant 
interneuron marker of each subtype [VIP, SST, 
NPY, and PV (Pvalb)] (Fig. 2, L to N, and fig. S5, 
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Fig. 4. STARmapping cell types and neural activity in mPFC. (A) Diagram of targeted region 

(red box) containing primarily prelimbic cortex (PrL) within mPFC. (B) UMAP visualization of all 
inhibitory (VIP, Reln, SST, Lhx6, and NPY), excitatory (eL2/3, eL5-1, eL5-2, eL5-3, eL6-1, and eL6-2), 
and non-neuronal (Astro, Oligo, Smc, and Endo) cell types. (C) Spatial visualization of cell type layout 
in mPFC, using the same color scheme as in (B). (D) Barplot of representative genes per cluster (mean + 
95% confidence interval), with each bar scaled to the maximum mean expression for that gene 
across clusters. (E) Piecharts showing the relative proportion of each major and minor cell type in both 
mPFC and visual cortex. (F) Violin plots of Fos gene induction in different excitatory cell types in mPFC 
in response to cocaine. The mice were sacrificed after 1 hour of cocaine or saline injection. Expr, 
normalized expression; n.s., not significant; *P < 0.05, ***P < 0.001, ****P < 0.0001, likelihood ratio 
test. Astro, astrocytes; Oligo, oligodendrocytes; Smc, smooth muscle cells; Endo, endothelial cells. 


C and D); the VIP and NPY type were observed 
to distribute more to the upper layers (L1 to 
L3), whereas SST and PV types were found 
more commonly in the lower layers (L4 to 
L6). We also detected non-neuronal cell types, 
including astrocytes, oligodendrocytes, endothe- 
lial cells, and smooth muscle cells (fig. S6). The 
number of major cell types illustrated here (12 
in total) can be further broken down (single-cell 
RNA sequencing can lead to classification into 
40 or more subtypes, which is consistent with 
the readily apparent heterogeneity of gene 
expression within each type) (figs. S5 and S6). 
With our targeted 112-gene set and at the size of 
600 to 800 cells per sample, all 12 major cell 
types could be reliably detected without batch 
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effects with highly similar spatial patterning 
among four biological replicates (defined as sam- 
ples prepared from different animals) (fig. S7) 
and matched with published single-cell RNA se- 
quencing results (fig. S8). 

We next sought to take advantage of STARmap’s 
quantitative capabilities at the single-cell level, 
in order to test differential gene expression analy- 
ses across experimental conditions, in molec- 
ularly defined cell types. To this end, we assessed 
visual stimulus-dependent gene expression 
patterns (via 48 defined ARGs with single-cell 
resolution in situ). Further developing the single- 
cell RNA sequencing procedure, mouse brains 
were flash-frozen with minimal handling time 
after sacrifice (<5 min), for maximal preservation 


of native transcriptional signatures. Consistent 
with prior reports (26-28), we observed global 
induction of known immediate-early genes (Fos, 
Egri, and Egr2) (Fig. 3A) in the primary visual 
cortex (V1) upon 1 hour of light exposure. At 
single-cell resolution, the quantitative extent (fold 
change in expression) of ARG changes exhibited 
striking diversity across neuronal cell types (Fig. 3, 
B and C, and fig. S9) (28). In general, ARG ex- 
pression programs in excitatory neurons across 
different layers were highly similar, whereas 
ARG expression programs in inhibitory cells ex- 
hibited much more distinct cell type-specific 
characteristics (fig. S9C); for example, Egr2 ex- 
hibited light-induction across excitatory neurons 
(Fig. 3D) but not in inhibitory neurons, whereas 
Prok2 was up-regulated in Vip inhibitory neurons 
(Fig. 3C) (22). Last, because neural activity can 
trigger cotranscription of noncoding RNAs from 
within enhancers of ARGs (26, 37), we also studied 
exemplars of these enhancer RNAs (eRNAs) (here, 
eRNAs 1 to 5 of the Fos gene); these transcripts, 
not polyadenylated, would be very difficult to 
measure with current single-cell RNA sequenc- 
ing. eRNA3 was identified as the most notable 
and consistent ARG marker (fig. S9B). 


Comparing spatial cell-type distributions 
in frontal and sensory cortices 


We then investigated to what extent the cell types 
of the higher cognitive cortex resemble those of 
the sensory cortex, as exemplified by V1. We 
applied the same 160-gene set to STARmapping 
the cell populations of the medial prefrontal cor- 
tex (mPFC) (Fig. 4A), which is involved in high- 
level cognitive functions such as attention and 
memory and is thought to be dysregulated in 
major psychiatric disorders (32). We identified 
15 distinct molecular cell types, including six 
excitatory neuron subtypes (eL2/3, eL5-1, eL5-2, 
eL5-3, eL6-1, and eL6-2, annotated by anatomic 
cortical layers), five inhibitory neuron subtypes 
(VIP, Reln, SST, Lhx6 and NPY, annotated by 
dominant gene markers), and four non-neuronal 
types (astrocytes, oligodendrocytes, endothe- 
lial cells, and smooth-muscle cells) (Fig. 4B and 
fig. S10). 

The spatial organization of broad cell types 
in mPFC resembled that of V1 with intermixed 
excitatory neuronal layers and sparsely distri- 
buted inhibitory neurons (Fig. 4C); however, the 
nature and composition of neuronal subtypes in 
mPFC and VI strikingly differed (Fig. 4, D and 
E). For excitatory subtypes, mPFC lacks eL4: (which 
is consistent with previous reports) (33) and 
exhibits reduced eL2/3 and vast expansion of 
eL5 and eL6 compared with that of V1 (Fig. 4E). 
Many new types of cell were discovered, includ- 
ing three eL5 subtypes and two eL6 subtypes, as 
characterized by gene markers Sema3e, Plexd2, 
Tpbg, Syt6, and Ctgf, respectively (Fig. 4D). 

Substantially different tissue organization by 
cell type was also observed for inhibitory sub- 
clusters. Sst-, Vip-, and Npy-positive subtypes 
in mPFC were represented similarly among all 
inhibitory neurons compared with those in V1, 
whereas Pvalb-positive cells were comparatively 
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Fig. 5. Simultaneous mapping of 1020 genes in V1 by STARmap. (A) Input fluorescence data. 
(Left) Maximum-intensity projection of the first sequencing round for 1020 gene experiment, showing 
all four channels simultaneously. Yellow square, zoom region. Scale bar, 100 um. (Right) Zoom into a 
single cell showing spatial arrangement of amplicons in three dimensions across six sequencing 
rounds. (B) Joint UMAP plot showing all excitatory (HPC, eL2/3, eL4, eL5, eL6-1, and eL6-2), non- 
neuronal (Sme, Other, Olig, Micro, Endo, and Astro), and inhibitory (PVALB, SST, VIP, and NPY) cell 
types. (C) Plot of all differentially expressed genes across every cluster, with P < 10°! and log fold 
change > 1.5. (D) Spatial map of all excitatory, non-neuronal, and inhibitory cell types in visual cortex 
using the same color code of (B). HPC, hippocampus; Smc, smooth muscle cells; Other, other 
unclassified cells; Oligo, oligodendrocytes; Micro, microglia; Endo, endothelia cells; Astro, astrocytes. 


much sparser. In V1, Reln-positive neurons co- 
exist with Sst and Npy, whereas in mPFC, these 
segregate as a single cluster, with ~50% co- 
marked by Ndnf; we also discovered a new in- 
hibitory subtype labeled by Lha6, which in fact 
constitutes the most abundant inhibitory sub- 
type in mPFC (Fig. 4E). Although the 5-HT(3A) 
receptor (Htr3a) expression has been reported 
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in cortical inhibitory neurons (34), Htr3a has 
not been ranked as a critical genetic marker of 
inhibitory subtypes in V1. In mPFC, however, 
we found that Htr3a distinguishably marks a 
large fraction of Vip* neurons and a subset of 
Reln* neurons (fig. S10D). 

Superficial layers (L1 to L3) were found to 
contain Vip, Reln, and Npy subtypes, whereas 


deeper layers (L5 to L6) were found to contain 
all of the inhibitory subtypes. All of the 15 cell 
types with tissue-level spatial organization could 
be reliably detected with STARmap across four 
biological replicates (fig. S11). The capability of 
STARmap for multidimensional cell typing in 
mPFC was further demonstrated in the setting 
of activity dependence, supporting the possibil- 
ity of defining cell types in part by communica- 
tion properties, including activity during behavior 
(35, 36). One hour after cocaine injection (37), a 
specific subpopulation of deep-layer excitatory 
neurons (such as Tpbg labeled eL5-2) in mPFC 
was activated compared with that in saline-injected 
control mice (Fig. 4F), revealing STARmap capa- 
bility for identifying functional segregation of 
neuronal subtypes in mPFC. 


Scaling STARmap to more than 
1000 genes 


To further test the scalability of STARmap, we 
extended our gene list from 160 to 1020 genes, 
leveraging previously published single-cell RNA 
sequencing data (24). The 1020-gene set was first 
validated in mouse hippocampal neuron culture, 
with successful resolution of neuronal and glial 
cells (fig. S12). We then probed mouse V1 neo- 
cortex with the 1020-gene set in order to evaluate 
performance in spatial cell typing in comparison 
with that of the 160-gene set. Amplicons obtained 
in the 1020-gene experiment were much denser in 
cells as compared with those in 160-gene experi- 
ments but were optically resolvable in 3D with 
high-resolution imaging and postimaging decon- 
volution (Fig. 5A). 

We observed that a higher percentage (40%) 
of amplicons were filtered out in the 1020-gene 
experiments by our error-rejection mechanism 
(fig. S3H) in comparison with the four-gene ex- 
periments (20%) (fig. S3L), indicating that a more 
frequent initial color-misassignment potentially 
resulted from amplicon merging or optical reso- 
lution and further demonstrating the importance 
of our designed error-rejection mechanism. 
Crucially, despite the read loss, we successfully 
clustered single cells of the imaging area into 
15 annotated cell types and one unclassified type 
using 1020 genes and the same data analysis pipe- 
line from the focused 160 gene probe set (Fig. 5, 
B and C, and fig. S13). Three new cell types were 
identified in addition to the 12 cell types detected 
by 160 genes (Fig. 5B): eL6 was resolved into 
two subtypes (eL6-1 and eL6-2), a previously 
uncharacterized hippocampal excitatory subtype 
(HPC) was identified, and microglial cells were 
cleanly identified with an expansion of non- 
neuronal cell type markers in the 1020-gene set. 

Beyond those advances, the 1020-gene findings 
also successfully reproduced the cell types (and 
their spatial patterning) from the 160-gene find- 
ings and further allowed discovery of multiple 
new gene markers for each cell type (for example, 
3110035E14Rik for deep layers, Cnot6l for Sst 
neurons, and CplxI for Pvalb neurons) (Fig. 5D 
and fig. $13). These molecularly defined cell 
types were highly reproducible between biolog- 
ical replicates for 1020-gene detection and were 
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Fig. 6. 3D architecture of cell types in visual 
cortex volumes. (A) Volumetric STARmapping 
via sequential SEDAL gene readout. Using a 
modified STARmap procedure (fig. S16) and 
cyclic gene readout (four genes in each cycle), 
large tissue volumes can be rapidly mapped at 
single-cell resolution without oversampling each 
amplicon. (B) Validation showing specific 
STARMAP labeling of YFP-expressing neurons 
(from transgenic Thyl::YFP mouse line) in 

3D cortical volume. Scale bar, 0.5 mm. 

(C) Representative labeling of (left) major cell 
types, (left center) layer-specific markers, 
(right center) inhibitory markers, and (right) 
activity-regulated genes acquired over 
multiple rounds in visual cortex STARmap 
volumes. (D) Per-cell expression matrix of 
28 genes from 32,845 single cells from one 
volume clustered into multiple excitatory, 
inhibitory, and non-neuronal cell types, 
z-scored across genes for each cell in order 
to normalize for mean differences in total 
signal between cells. Columns are sorted by 
order of sequencing rounds as conducted, in 
groups of four. (E) (Top) Spatial histograms of 
excitatory, inhibitory, and non-neuronal 

cell types, using same color labels as (D). 
Cells were counted in 5-um bins in a 2D 
max-projection and plotted in cell-count-per- 
micrometer units as a function of distance 
from the corpus callosum (cc) to the pia, 
averaged across the bins perpendicular to 
the cortical layers. (Bottom) Plot of 
max-projected cell locations color-coded by 
cluster as in (D). (F) Spatial distribution of 
each cell type (excitatory, inhibitory, and 
non-neuronal) and subtypes in three 
dimensions. Each dot represents a single 

cell; spatial dimensions are in micrometers. 
(G) Average nearest-neighbor distances 
computed in three dimensions between all 
excitatory cells (Excite) and each inhibitory cell 
type. For self-comparisons, the nearest neigh- 
bor was defined as the closest nonidentical cell; 
persistent self-correlation reveals self-clustering 
of inhibitory subtypes. (H) Same distances as 
(H) but using shuffled (randomized) cell-type 
labels. (I) Nearest-neighbor distances com- 
puted in three dimensions between each inhib- 
itory cell of a certain type and any member of 
the same type (Inhib > Inhib, eg VIP > VIP) 

or any excitatory neuron (Inhib > Excite). 

**** P < 0.0001, Wilcoxon rank-sum test. 


concordant with published single-cell RNA se- 
quencing results (fig. S14). We further assessed 
the possibility of scaling up STARmap to ac- 
commodate higher gene numbers; although the 
STARmap scheme can encode and decode more 
than 1 million codes and the physical volume of 
mammalian cells is not limiting for amplification 
of more than 1000 genes (fig. S15), the 1020-gene 
experiments approached the upper limit of the 
optical volume of cells (fig. SI5E); for those cases 
in which more genes are needed, STARmap 
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STARmap optimized for rapid imaging of large tissue volume 
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may cover the whole transcriptome with more se- 
quencing rounds of serial 1000-gene detection, 
or via optical resolution enhanced with super- 
resolution microscopy (38, 39) or the physical swell- 
ing typical of the hydrogel-tissue chemistries (74, 19). 


Adapting STARmap to thick tissue 
blocks for 3D analyses 


In neuroscience, addressing the 3D complexity 
of both neurons and neural circuits has gener- 
ally required the development and use of thick 
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tissue blocks or fully intact brains for functional 
and structural readouts, including electrophysio- 
logy, imaging of activity, and analysis of mor- 
phology and connectivity. Therefore, for linking 
these readout measures from intact or semi- 
intact tissue preparations with cellular-resolution 
gene expression readouts from the very same 
preparations, methods of 3D spatial transcrip- 
tomic analysis in thick tissues have long been 
sought in order to achieve datastream regis- 
tration as well as preserve 3D morphology and 
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to obtain readouts from very much larger cell 
numbers (2). The initial experiments were car- 
ried out in brain slices no more than one cell 
body thick; we therefore next developed and 
tested STARmap to overcome limitations in 
diffusional access and imaging throughput for 
large tissue volumes, with a modified strategy for 
linearly reading out gene expression at cellular 
resolution so as to enable high-throughput mo- 
lecular analysis in tissue volumes (Fig. 6A and 
fig. S16). Specificity and penetration depth of 
large-volume STARmap were tested initially by 
using Thyl::YFP mouse brains, in which STARmap 
successfully detected yellow fluorescent protein 
(YFP) mRNA across 150 um of tissue thickness 
and specifically colocalized YFP protein and mRNA 
at single-cell resolution (Fig. 6B) without label- 
ing the tens of thousands of interspersed neigh- 
boring cells. 

We then extended the spatial cell-typing of 
mouse V1 to more than 30,000 cells across 
volumes spanning all six layers and the corpus 
callosum. Using a curated gene set including 23 
cell-type markers and five ARGs read out over 
seven cycles of linear SEDAL sequencing (Fig. 6, 
C and D, and fig. S17), we applied K-means clus- 
tering of marker genes (supplementary materials, 
materials and methods) for each cell type (recov- 
ering 11 cell types corresponding to the majority 
of those extracted by the 160-gene experiment). 
We found that 3D patterning of the 11 cell types 
(Fig. 6, E and F) was consistent with the 160- 
gene thin-section tissue findings but provided 
an accurate and quantitative profiling of cellular 
distribution across space, with much larger cell 
numbers. As reflected by both spatial-histogram 
(Fig. 6E) and correlational analyses (fig. S17B), 
excitatory subtypes exhibited a layered gradient 
distribution, with the spatial density of each sub- 
type decaying across space into adjacent layers. 
By contrast, inhibitory subtypes were dispersed, 
albeit with layer preferences exhibited by the Vip 
subtype (largely located in layer 2/3) and the Sst 
and Pvalb subtypes (in layers 4 and 5). Non- 
neuronal cells were largely seen in layer 1 and 
white matter. 

To discover yet-finer volumetric patterns, we 
further analyzed the distribution of distances 
from each individual cell from each sequencing- 
defined subtype to its nearest neighbors, find- 
ing unexpectedly that the nearest neighbor of 
any inhibitory neuron tended to be its own 
subtype, rather than excitatory neurons or other 
inhibitory subtypes (Fig. 6G). If inhibitory neu- 
rons were randomly dispersed among the more 
abundant excitatory neurons in a purely salt- 
and-pepper distribution, the distance between 
inhibitory neurons would be larger than that 
from inhibitory to excitatory neurons (Fig. 6H). 
Instead, the actual intrasubtype distance of in- 
hibitory neurons was much shorter (~15 um, 
which is equivalent to the size of a single neu- 
ron, indicating direct somatic juxtaposition) 
(Fig. 61), revealing a short-range self-clustering 
organization of inhibitory subtypes across vol- 
umes that could only be accurately measured in 
three but not in two dimensions (fig. SI8A). When 
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guided by this initial STARmap observation, 
evidence for such patterning could be also ob- 
tained in transgenic mouse lines (fig. S18, B and 
C). This discovery bears considerable relevance to 
previous functional work; for example, electro- 
physiological studies have suggested that inhib- 
itory neurons in spatial proximity tend to be 
connected by electric (gap) junctions important 
for setting up synchronized firing patterns 
(40, 41), and in vivo imaging has suggested that 
inhibitory-neuron groupings in visual cortex could 
sharpen visual responses (42). 


Discussion 


STARmap defines a platform for 3D in situ tran- 
scriptomics, enabled by state-of-the-art DNA li- 
brary preparation and sequencing in an HTC 
formulation. Here, STARmap was shown to be 
applicable to the study of molecularly defined 
cell types and activity-regulated gene expression 
in mouse cortex and to be scalable to larger 3D 
tissue blocks so as to visualize short- and long- 
range spatial organization of cortical neurons 
on a volumetric scale not previously accessible. 
In future work, STARmap may also be adapted 
to longer sequencing lengths or higher gene num- 
bers; there is no intrinsic limit to the number of 
genes or RNA species that can be simultaneously 
and quantitatively accessed with STARmap (fig. 
$15); STARmap may also be capable of integrat- 
ing cell-type information with single-neuron mor- 
phology and projection anatomy (for example, by 
means of Brainbow and MAPseq) (43, 44) as well 
as with in vivo neural activity imaging and 
electrophysiology. This platform can also be gen- 
eralized to study other heterogeneous cell popula- 
tions in diverse tissues across the body, although 
the brain poses special challenges well suited to 
STARmap analysis. For example, the polymor- 
phic ARG expression observed across different 
cell types is likely to depend on both intrinsic 
cell-biological properties (such as signal trans- 
duction pathway-component expression) and 
on extrinsic properties such as neural circuit 
anatomy that routes external sensory informa- 
tion to different cells (here, in visual cortex). In 
general, it may not be possible to fully define 
brain cell typology independent of such 3D anat- 
omy as well as activity patterns exhibited and 
experienced by cells during behavior; the nature 
of input and output communication pathways for 
the cells in question in fact can form the founda- 
tion for defining cell types (35, 36). Toward 
this end, in situ transcriptomics exemplified 
by STARmap can effectively link this imaging- 
based molecular information with complementary 
cellular-resolution datastreams describing anat- 
omy, natural activity, and causal importance, thus 
promising to fundamentally deepen our under- 
standing of brain function and dysfunction (2). 


Methods summary 


All animal procedures followed animal care 
guidelines approved by Stanford University’s 
Administrative Panel on Laboratory Animal Care 
(APLAC) and guidelines of the National Institutes 
of Health. For thin sections, animals were anes- 


thetized and rapidly decapitated; the brain tis- 
sues were sliced by use of a cryostat. For thick 
sections, animals were anesthetized and trans- 
cardially perfused with paraformaldehyde; the 
brain tissues were sliced by use of a vibratome. 
In STARmap experiments, tissues were hybridized 
with SNAIL probes, enzymatically amplified, 
hydrogel embedded, and sequentially imaged 
by using the SEDAL process and a confocal mi- 
croscope. The resulting image datasets were 
registered across multiple cycles by using the 
positions of all amplicons in each cycle and 
decoded. For cell typing and single-cell gene 
expression analyses, the amplicons were attrib- 
uted to individual cells based on segmentation 
images of fluorescent Nissl staining. All the 
detailed procedures for the experiments and data 
analyses are described in the supplementary 
materials. 
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Teaching an old carbocation new 
tricks: Intermolecular C-H insertion 
reactions of vinyl cations 


Stasik Popov’, Brian Shao’, Alex L. Bagdasarian’, Tyler R. Benton’, Luyi Zou”, 
Zhongyue Yang", K. N. Houk"{, Hosea M. Nelson" 


Vinyl carbocations have been the subject of extensive experimental and theoretical 
studies over the past five decades. Despite this long history in chemistry, the utility of vinyl 
cations in chemical synthesis has been limited, with most reactivity studies focusing on 
solvolysis reactions or intramolecular processes. Here we report synthetic and mechanistic 
studies of vinyl cations generated through silylium—weakly coordinating anion catalysis. 
We find that these reactive intermediates undergo mild intermolecular carbon-carbon 
bond-forming reactions, including carbon-hydrogen (C-H) insertion into unactivated sp? 
C-H bonds and reductive Friedel-Crafts reactions with arenes. Moreover, we conducted 
computational studies of these alkane C—-H functionalization reactions and discovered that 
they proceed through nonclassical, ambimodal transition structures. This reaction 
manifold provides a framework for the catalytic functionalization of hydrocarbons using 


simple ketone derivatives. 


or more than a century, carbocations have 
played a central role in the chemical sci- 
ences, inspiring the development of broadly 
applied chemical reactions and a greater 
understanding of the fundamental prop- 
erties of molecules (1). This is pointedly true for 


Fig. 1. C-C bond-forming reactions 
of dicoordinated carbocations. 

(A) Phenyl cation insertion into sp? 
C-H bonds. (B) Initial proposal 

of carbocation rebound mechanism 
in ring-contraction reactions 

of cyclononeny!I triflates. (©) Proposed 
catalytic cycle for reductive 

alkane alkylation. (D) Identification 

of cyclohexeny| triflate as a 
persistent vinyl cation precursor 
owing to the prohibitive barrier 

to 1,2-hydride shift. Calculations 
performed at MO62X/6-311+G(d,p) 
level of theory; [WCA] omitted 

for clarity. R, alkyl; R}, aryl, 

alkyl, halide, or silyl ether; R°, 

alkyl or aryl; cat., catalytic; TMS, 
trimethylsilyl; OTf, trifluoromethanesulfonate; 
Et, ethyl; EtOH, ethanol; AG, 

Gibbs free energy; AG?, Gibbs 

free energy of activation. 


Popov et al., Science 361, 381-387 (2018) 
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vinyl cations. Beginning with Jacobs’ proposal 
of a vinyl cation intermediate (2) and continuing 
through the investigations by Grob et ai. (3), 
Hanack (4), Rappoport and Gal (5), Schleyer and 
others (6), Stang and Rappoport (7), and others 
(8-1), this class of reactive intermediates has 


A C-H insertion of putative phenyl cations 


H 
Qo 
~ TMS c 
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3 s Q 
WCA cl 
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oy) Et,SiOTf 
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Et,Si-H 
H [wear WwcAl- 
: @aH [wea] 
Co" 
nd 
1 1,2-hydride 10 


shift 


been the subject of extensive theoretical and ex- 
perimental studies. The structure and bonding 
of these cations has fascinated theorists, driving 
the development of computational techniques 
over the past five decades (12-75). Vinyl cation 
intermediates have also been implicated in mod- 
ern transition metal-catalyzed processes (16) and 
in classical acid-catalyzed reactions (17). Despite 
this rich history and broad scientific relevance, 
most reactivity studies have focused on solvolysis 
reactions where the reactive vinyl cation is inter- 
cepted by heteroatom-containing solvent mol- 
ecules (18, 19). 

Recently, our group reported that putative 
phenyl cations (1; Fig. 1A), a related class of 
dicoordinated carbocations, undergo insertion 
into sp® C-H bonds (20). Inspired by this result, 
we hypothesized that vinyl cations may display 
analogous C-H insertion reactivity. Challenging 
this hypothesis was a lack of substantial precedent 
for this mode of reactivity, despite the extensive 
experimental studies of vinyl cations over the past 
five decades. However, a few examples of vinyl 
cations engaging in C-C bond-forming reactions 
have been reported. Hanack and others reported 
the formation of fused products 2 and 3 from 
the solvolysis of cyclononeny] triflate 4 (Fig. 1B) 
(21, 22). In this reaction, and in analogous reports 


‘Department of Chemistry and Biochemistry, University of 
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*Laboratory of Theoretical and Computational Chemistry, 
Institute of Theoretical Chemistry, Jilin University, 
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*Present address: Department of Chemical Engineering, Massachusetts 
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from Olah and Mayr (23) and Caple and others 
(24), this unusual C-C bond-forming process was 
rationalized via a carbocation rebound mech- 
anism. Here the vinyl cation 5 is quenched via 
1,5-hydride shift, and the neutralized alkene (6) 
attacks the newly formed alkyl carbocation. This 
sequence forges the ring fusion of the bicyclononane 
cation 7, leading to products 2 and 3. The inser- 
tion of vinyl cations into methane C-H bonds has 
also been observed in the gas phase (25). More 
recently, Metzger and others (26) and Brewer and 
others (27) have independently proposed the 
intramolecular C-H insertion of vinyl cations 
generated from alkynes or diazo compounds in 
solution. 

The solvolysis studies described above relied 
on polar solvents to facilitate thermal ionization 
of vinyl species bearing halide or pseudohalide 
leaving groups (for example, Fig. 1B). This ap- 
proach almost invariably leads to attack of solvent 
on the cationic carbon center, leaving reactivity 
beyond the Syl manifold largely undiscovered. We 
drew on our phenyl cation studies to overcome 
this limitation by using silylium-monocarba-closo- 
dodecaborate salts (formed in situ through the 
reaction of triethylsilane with [Ph3C]’THCBy,ClyT ) 
in hydrophobic solvents to generate vinyl cations. 
Here the nonnucleophilic and nonbasic properties 
of weakly coordinating anions (WCA), such as 
undecachlorinated monocarba-closo-dodecaborate 
anion (HCB,,Cly ) (28), would enhance the Lewis 
acidity of the cationic silicon center, allowing for 
mild ionization of alkene derivatives. In this sce- 
nario, silylium-mediated ionization of a vinyl tri- 
flate (8) would generate a kinetically persistent 
vinyl cation-WCA ion pair (9; Fig. 1C). Insertion 
of this reactive dicoordinated cation (9) into an 
alkane C-H bond would lead to formation of 
alkyl carbocation 10. A 1,2-hydride shift would 
lead to the more stable tertiary cation (11) that, 
upon reduction by a sacrificial silane (29), would 
generate the functionalized hydrocarbon product 
12 and regenerate the silylium-carborane initia- 
tor. In its entirety, this process would enable the 
direct C-H alkylation of alkanes and arenes by 
simple ketone derivatives. We now report the 
successful execution of this mechanistic hypoth- 
esis, entailing intermolecular, reductive C-H al- 
kylation mediated by silane and catalytic quantities 
of carborane salts. Moreover, we describe the un- 
expected discovery of an ambimodal transition 
state and post-transition state bifurcation in this 
reaction that leads to several products directly 
through a nonclassical mechanism (30-32). Ambi- 
modal transition states have characteristics of two 
or more classical transition states, just as non- 
classical ions simultaneously possess the features 
of two classical ions. 


Vinyl cation generation from 
cyclohexeny| triflates 


Given the known propensity of substituted cyclic 
vinyl cations to undergo rapid intramolecular 
rearrangements (33), we identified prospective 
cation precursors through density functional 
theory calculations. First, we verified that a 1,2- 


hydride shift in cyclohexenyl cation 9, which 
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Table 1. Reductive alkylation of cyclic vinyl triflates. Reactions performed at 0.1 M in 
hydrocarbon solvent. *Yield determined by GC-FID with nonane as an internal standard. tlsolated 


yield. R4, alkyl. 
[PhsC]*[HCB,,Cl,,J- (2 mol%) 
om EtsSiH (1.5 equiv) ole 
R! + alkyl—H R! 
n=1or3 See et n 
Entry Substrate Solvent Product Yield (%), Time (h) 
OTf 
1 or CgHi2 87*, 1.5 
8 14 
OTf 
‘ H 
OTf mab 
3 Cy n-CsHjo I \u 68", 1.5 
8 21:36:11 (a:B:y) 
83t, 3 
4 C,H , 
eat (15:1 air) 
OTf 
5 eo CeHio Ee 91", 1 


would generate the more stable cyclic allyl cat- 
ion 138, is kinetically unfavorable (Fig. 1D). The 
calculated high barrier, despite the impressive 
exothermicity of the rearrangement, results from 
the geometrical distortion required to achieve 
overlap of the migrating H 1s orbital with the 
nearly orthogonal orbitals at the origin and 
terminus of the migration. On the basis of this 
computational result, we initiated our studies 
with cyclohexeny] triflate (8). We were pleased 
to find that exposure of cyclohexeny] triflate (8) 
to 1.5 equivalents of triethylsilane and 2 mole % 
(mol %) [PhsC]*THCB,,Cl,] in dried cyclohexane 
solvent at 30°C in a nitrogen-filled glove box re- 
sulted in the formation of cyclohexylcyclohexane 
(14s Table 1, entry 1) in 87% yield, after 90 min. 
Intrigued by the remarkably mild conditions 
used in this alkane alkylation reaction, we un- 
dertook a brief study of scope to further eluci- 
date potential synthetic applications and to gain 
mechanistic insight. Cycloheptane and m-pentane 
also reacted efficiently with the cyclohexenyl 
cation, albeit with poor regioselectivity in the 
latter case (Table 1, entries 2 and 3). Although 
cyclohexenyl triflates bearing substituents at 
the carbons in the 2- or 6-position led to com- 
plex mixtures of products, presumably the result 


of nonproductive unimolecular decompositions 
(33), other positions of the cyclohexenyl ring 
were tolerant of substitution. For example, ex- 
posure of the enol triflate derived from 5a- 
cholestan-3-one (Table 1, entry 4) to our reaction 
conditions led to formation of the alkylated 
steroid 15 in 88% yield and 15:1 diastereomeric 
ratio (d.r.) (relative stereochemistry determined 
by x-ray crystallography). Analogous to the pre- 
viously reported ring-contraction reactions of 
medium-sized cyclic vinyl triflates (27), expo- 
sure of cyclooctenyl triflate to our optimized 
reaction conditions led to rapid transannular 
C-H insertion to yield bicyclo[3.3.0]octane 
(Table 1, entry 5). 


Evidence of a nonclassical mechanism 


To probe the possibility of a hydride-abstraction, 
carbocation-rebound mechanism as proposed 
in early examples (22) (Fig. 1B), we studied the 
insertion reactions of unsymmetrical cyclohex- 
enyl triflate derivatives 16 and 17 (Fig. 2A). Ifa 
stepwise rebound mechanism were operative, we 
would expect 4,4-dimethylcyclohexeny] triflate 
(16) and 5,5-dimethylcyclohexeny] triflate (17) 
to yield identical product distributions, as they 
would both proceed through alkene intermediate 
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18 (Fig. 2A). In the event, we found that exposure 
of triflate 16 to our reaction conditions led to 
formation of Cl-cyclohexylated product 19 
in 68% yield and C2-cyclohexylated product 
20 in 6% yield. However, triflate 17, the 5,5- 
disubstituted analog, provided C1-cyclohexylated 
product 20 in 78% yield and C2-cyclohexylated 
product 19 in 6% yield. Although the C-H in- 


A 
: mixture 
+ Of —ee------- ‘ 
‘19 & 20 
| g° 
: common : : 
: alkene: 
‘ intermedia te; ‘ 


non-operative carbocation rebound 


+ 


sertion event occurred primarily at the Cl that 
originally bore the triflate group, in both sys- 
tems, minor products were observed that corre- 
spond to insertion at the C2 adjacent to the triflate 
(>10:1 C1:C2 insertion in both cases). Although 
these experimental results support a nonre- 
bound, concerted C-H insertion mechanism, 
they do not provide mechanistic insight into 


the formation of C2-alkylation products. There- 
fore, we undertook quantum mechanical calcu- 
lations and molecular dynamics (MD) simulations 
to probe the mechanism in more detail (movies 
S1 to S4). 

Calculations on the reaction of the cyclohexen- 
1-yl (9) cation with cyclohexane gives a very early 
ambimodal transition structure (TS), wherein the 


Experimental ratio MD ratio 


11:1 (19:20) 50:7 (19:20) 


“- 3 1:13 (19:20) 9:46 (19:20) 


OTf 
11:1 19:20 
2 _ selectivity 2 Or 
16 
joe ae T 6% ve T 
OTf 13:1 20:19 
_ Selectivity’ — 
17 


78% seer tT 6% ee t 


path 1 (major) 
direct 1,1-insertion 


O major 
HOA 

Oo. 

DT3 minor 


{ 1,2-H shift 
a 


INTi1, 


2» 1,2-H shift 
———__» 


1,2-H shift, 2-H shift. 
H 


INTi1, 


1,2-H shift, ie 


INTi78 


INTi4B 


Fig. 2. Discovery of an ambimodal transition state. (A) Probing the 
possibility of a rebound mechanism. Reactions were performed under the 
same conditions as described in Table 1. (B) Snapshots from MD 
simulations representing structures taken from multiple trajectories 
starting from the ambimodal TSg.-cy, illustrating the most common 
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reaction pathways, the post-transition state bifurcation, and resulting 
products. (C) Line drawing representations of the three-dimensional 
representations in (B). TYield determined by gas chromatography-flame 
ionization detector (GC-FID) with nonane as an internal standard. Calcu- 
lations performed at MO62X/6-311+G(d,p) level of theory. Cy, cyclohexyl. 
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geminal cyclohexane C-H bonds interact with 
the vinyl cation vacant orbital (TS9_cy, Fig. 2B). 
Quasi-classical MD trajectories were initiated from 
normal mode-sampled structures (a Boltzmann 
distribution) in the region of transition struc- 
ture TS9_cy. Taken together, these represent the 
ambimodal transition state, a single saddle-point 
region that can give rise to several different pro- 
ducts (30-32). Two typical trajectories are sum- 
marized in Fig. 2B, with several snapshots of 
structures illustrating the bifurcation of the re- 
action pathway. The common trajectory path from 
ambimodal TS9_cy begins with hydride transfer 
from cyclohexane to the vinyl cation via DT, 
(where DT stands for dynamical transient, not an 
intermediate or transition state) (Fig. 2B). Unlike 
arebound mechanism, the cyclohexene and cyclo- 
hexyl cation moieties fail to separate; the newly 
formed cyclohexyl cation coordinates (DT3) with 
the alkene of the neutralized cyclohexene, in most 
trajectories, at the carbon that previously bore 
the cation (C1) (Fig. 2B). The resulting bridging 
nonclassical species (DT3) continues to different 
products through different trajectories (Fig. 2B). 
DTz3 is an unsymmetrical nonclassical carboca- 
tion, with the cyclohexyl bound more strongly 
at Cl, but still provides the three-center two- 
electron hypervalent bonding (very strong hyper- 
conjugation) to stabilize the carbocation. In most 
of the trajectories, DT3 collapses to the classical 
secondary carbocation structure, DT,, that un- 
dergoes a barrierless 1,2-hydride shift to form the 
tertiary cyclohexylcyclohexyl cation as a poten- 
tial energy minimum at —40.9 kcal/mol (INT,,,) 
(movies S2 and S4). In fewer trajectories, DT; 
goes on to the opposite asymmetric nonclassical 
species DT;, favoring C2 functionalization (Fig. 
2B). DT; leads to DTg and eventually INT. 
(movies S1 and S3). Through the stabilizing ef- 
fects and geometric constraints (fig. S40) (34) 
of these persistent nonclassical intermediates, 
energetically favorable hydride shifts are delayed, 
allowing for the experimentally observed dynamic 
distribution of products in our ring-substituted 
cyclohexen-1-yl cation substrates (Fig. 2A). The 
free energies of various stationary points on the 
surface between ambimodal TS9.cy and final 
stable carbocation INT}, show that minima are 
very shallow and are, therefore, not equilibrated 
during the MD trajectories (fig. S39) (34). 


Reactivity of acyclic vinyl triflates 


Having established the catalytic, intermolecular 
C-H alkylation reactions of cyclic vinyl triflates, 
and having come to a deeper understanding of 
the nonclassical reactivity of cyclic vinyl cations, 
we extended our investigations to the reactivities 
of acyclic vinyl triflates. Acyclic vinyl cations have 
constituted most of the previous experimental 
and theoretical studies of dicoordinated cations, 
as cyclic variants have traditionally proven diffi- 
cult to generate under solvolytic conditions (35). 
Moreover, depending on substitution, several 
studies support nonclassical ground-state bond- 
ing in vinyl cations, structures reminiscent of 
protonated alkynes (36-38). Subjection of bute- 


nyl triflate 21 to the reaction conditions led to 
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Table 2. Reactions of acyclic vinyl triflates and calculated nonclassical INT23-22/24 for 
insertion of cyclohexane into butenyl cation. *Reaction performed at 0.1 M and 30°C, with 
triisopropylsilane used instead of triethylsilane. {Reaction performed at 0.005 M and 30°C. 
Reaction performed at 0.1 M and 70°C. §Reaction performed at 0.008 M (1:1 v:v), 5 mol % catalyst 
loading, and —40°C. |lYield determined by GC-FID with nonane as an internal standard. R, alkyl or H; 


R}, alkyl or H. 
OTf [Ph3C]*[HCB,,Cl,,/- (2 mol%) cy 
da [ ) Et3SiH (1.5 equiv) agi 
& 1 + R 
, 1 ° H —40 to 70 °C 8 : Cc 
e ° A Bae 
Entry Substrate Solvent Temp. (°C) Product 
t oS ie CeHi2 30 a es 85! 
OTf Cy 
21 
25:1 EZ ds 
ot Ne CeHi2 30 ae . acy 40, 39!! 
OTf Cy 
23 22 24 
$ ow I] 
3 FZ ~ott CeHi2 70 i a ea ar 16, 19 
25 Cy 
dei EZ 
: 22 24 
4s “NF CHelyCeHys 40 16 ee el 
OTf y 
23 22 24 


unsymmetrically bridged 
nonclassical ion 


high-yielding reductive alkylation of cyclohexane, 
providing 2-cyclohexylbutane (22) in 85% yield 
(Table 2, entry 1). We found that the treatment 
of a cyclohexane solution of but-1-en-2-yl triflate 
(23) with 2 mol % of [Ph3C]*THCB,Cly] and 
1.5 equivalents of triethylsilane led to formation 
of a 1:1 mixture of 1-cyclohexylbutane (24) and 
2-cyclohexylbutane (22) (Table 2, entry 2). More- 
over, use of a 1:1 mixture of CgDyo:CgHy in this 
system led to exclusive formation of D,. and Do 
isomers, without any detectable amount of cross- 
over products (that is, CygpHg.nDy2_, OF CigH29-»Dp, 
n # 0) that would be expected in a stepwise 
process (fig. S24) (34, 39). Use of terminal tri- 
flate 25 (Table 2, entry 3) resulted in an identical 
product distribution to that of entry 2, albeit re- 


hydride approach 


hydride approach 


leading to 22 leading to 24 


quiring extended reaction times. Although pri- 
mary vinyl cations are difficult to generate under 
solvolytic conditions (77), here the abstraction of 
the terminal triflate by silylium cation may provide 
a lower-energy transition through favorable Si-O 
bond formation. Moreover, a favorable pathway 
for a concerted triflate abstraction and hydride 
migration leading directly to a secondary vinyl 
cation has been calculated (fig. S43). Carrying out 
the reaction at -40°C in chloroform solvent al- 
lowed for selective formation of 1-cyclohexylbutane 
(24) (about 2:1) (Table 2, entry 4). These results 
are consistent with our view of a nonclassical 
intermediate (Table 2, INT 3399/24) (movie S5), 
where hydride delivery from silane determines pro- 
duct distributions. In the absence of coordinating 
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solvent (entries 1 and 2), hyperconjugative sta- 
bilization may lead to a greater degree of bridg- 
ing. As we increase the amount of chloroform 
solvent to attenuate hyperconjugative effects, 
stabilization of a more classical 2° carbocation, 
instead of the energetically prohibitive 1° carbo- 


cation, would lead to internal Si-H quenching 
and the terminal product 24. The important 
observation of modulated product distributions 
in polar solvent (entry 4) implies that the pro- 
duct ratios of reactions that proceed through 
unsymmetrical nonclassical species can be dic- 


A [wea]? 
OTf a CeHi1 GH 
[Ph3C]*[HCB 14Cl,4] (2 mol%) D ert 
EtsSi-D (1.5 equiv) a 
8 erli2 
B 
5 LWOAl © [wea]® 
H @.¢,D Ph HaA@4,D 
Ph on 1,2-insertion ee 1, 1-insertion a 
ee es eee ree > 
C.D 12 CED 42 
27 
c p,100% 
Ph TT 
OTF [PhsC}*[HCB,,Cl,,]~ (2 mol%) 5 
. se H/D 35% 
Et3Si-H(1.5 equiv) H/D 65% 
26 CeD12 
28 
[woa]® [wea] 
Ph Ha@4,D 
a@ 1,1-insertion ote oe. 1,2-D 
—— > __Iph 
CeDi2 
27 
1,2-H 
D 
75% 
25% ons [wea]® 
= —_ D/H D Ha arePn 
TT [Ph3C]*[HCB,;Cl,,]~ (2 mol%) Ph CDi a. Poe 
eS le 
Et3Si-D (1.5 equiv) 
26 CeDi2 32 33 
E 12% 
88% wp [wea]? 
D OTf H/D H D>» aren 
a [PhgC]*[HCB,4Cly,J- (2 mol%) Ph CH on al 
— vila: 
Et3Si-H (1.5 equiv) 
34 CeHy2 35 36 


Fig. 3. Deuterium labeling studies of cyclohexeny| triflate derivatives. (A) Deuterium labeling 
experiment to support charge propagation step. (B) Experimental design to probe 1,1- versus 
1,2-C-H insertion. (© to E) Mechanistic experiments suggesting benzylic 1,2-hydride and deuteride 


shifts of the resulting bridged cation. Ph, phenyl. 
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tated by experimental factors that influence the 
degree of bridging. 


Isotopic labeling studies 


We also carried out deuterium labeling studies 
to further probe the mechanism. First, competition 
experiments with a 1:1 mixture of CgHyo:CgD 
provided a 1:1 mixture of cyclohexylcyclohexane-dp 
and cyclohexylcyclohexane-d,, supporting a con- 
certed reaction pathway with C-H insertion being 
a diffusion-controlled event (fig. S22). The use 
of triethylsilane-d, with cyclohexane and cyclo- 
hexeny] triflate (8) resulted in high levels of 
deuterium incorporation at the tertiary carbon, 
supporting the formation of a persistent 3° 
carbocation (Figs. 1C and 3A). However, this 
experiment does not differentiate a 1,1- from 
a 1,2-insertion mechanism; the 2° carbocation 
(10) formed via 1,1 insertion will always pro- 
ceed to the more stable 3° carbocation 11 (Fig. 
1C). To tease apart these different C-H insertion 
mechanisms, we prepared 3-phenylcyclohexenyl 
triflate 26 in the hope that migration of the 
benzylic C3 hydride of carbocation intermediate 
27 would be competitive with Cl-deuteride mi- 
gration, allowing for direct experimental evidence 
of a 1,1-insertion event (Fig. 3B). Upon exposure 
of triflate 26 to our reaction conditions, using 
cyclohexane-d;z solvent and triethylsilane, we 
observed 67% deuterium incorporation at C2, 
65% at Cl, and 35% at C4 (Fig. 3C). In total, we 
observed 1.6 deuterides incorporated at these 
three carbons (C1, C2, and C4), accounting for 
80% of the two cyclohexane-d,, deuterides prone 
to migration. Here we posit that following 1,1 
insertion, C3-hydride migration produces ben- 
zylic cation 29, whereas Cl-deuteride migration 
leads to tertiary C1 and C4 carbocations 30 and 
31. All three of these species (29 to 31) can be 
quenched by Si-H reduction. Further confirm- 
ing migration of the C3-benzylic H, using CgD,2 
and triethylsilane-d,, we observed 25% deuterium 
incorporation at C3 and 75% at C2 (32, Fig. 3D). 
Here D incorporation at C3 supports the forma- 
tion of a benzylic carbocation that results from 
1,2-hydride migration to quench a transient non- 
classical bridged cation (33). Moreover, use of 
C3-deuterated substrate 34 (Fig. 3E) with CeHy» 
leads to D incorporation at C2 (35, about 12%), 
further supporting the migration of the benzylic 
deuteride to quench the carbocation (36) gen- 
erated upon insertion. The lower levels of C2- 
deuteride incorporation in product 35 can be 
attributed to kinetic isotope effects, as the C3 
deuteride (36) will migrate slower than the C3 
hydride in cation 33, whereas the C1 hydride 
of cation 36 will migrate faster in this system 
than the deuteride of transient 33 (40). The 
fact that less migration of the benzylic C3 hy- 
dride is observed over the tertiary Cl hydride 
supports the nonclassical nature of the reac- 
tion, where a discrete classical secondary cation 
is not formed (Fig. 3D). 


Reactivity of vinyl cations with arenes 


Having established that vinyl triflates are com- 
petent vinyl cation precursors under silylium 
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[Ph3C]*[HCB,,Clj4f° (2 mol%) 
Et3SiH (1.5 equiv) 


—40 to 70 °C 


R! R! 
or 
hp) 
R2 n 


entry 1, 74% yield*t 
4 equiv 


entry 2, 49% yield*§ 


Cl 
Cl 
H 


entry 3 


10 equiv 56% yield (47:9)§ 


arene functionalization in pentane solvent 


entry 5 
51% yield (43:8)§ 


Bu” ; 


entry 9 
49% yield (9:1 d.r.)§ 


rad 


entry 13 
57% yieldt§ 


one 


entry 6 entry 7 
61% yield§ 78% yield§ 
entry 11 
90% yield (8:1 d.r)§ 64% yield§ 
O 14 entry 15 
43% yield§ 77% yield§ 


arene functionalization in chloroform solvent 


oo yy 


entry 17, 66% yield (47:19)§ entry 18, 80% yield§ 


10 equiv 


Fig. 4. Reductive arylation of vinyl triflates. *Reaction performed with 
1.2 equivalents triethylsilane. {Triisopropylsilane used instead of triethyl- 
silane. Yield determined by GC-FID, with nonane as an internal standard. 


catalysis conditions and that these reactive in- 
termediates undergo efficient sp? C-H function- 
alization reactions, we sought to investigate their 
reactivity with arenes. It has been reported that 
cyclic vinyl triflates are poor electrophiles in 
Friedel-Crafts arylation reactions (41). This find- 
ing has been attributed to the poor electrophilicity 
of vinyl cations (75) and to the slow ionization 
of vinyl cation precursors in arene solvents (41). 
In early studies, cyclohexeny] triflates were com- 
pletely unreactive toward arene nucleophiles, 
whereas triflates with larger ring sizes partici- 
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a2 


10 equiv 50 equiv 


ethyl; t-Bu, tert-butyl. 


pated in Friedel-Crafts reactions at increased 
temperatures, albeit in poor yields (47). We 
posited that use of silylium-carborane salts 
would allow for mild ionization of cyclic vinyl 
triflates in nonpolar solvents, allowing for facile 
Friedel-Crafts arylation reactions. We were pleased 
to find that with four equivalents of benzene in 
pentane solvent (1:22 molar ratio of benzene: 
pentane), cyclohexeny] triflate (8) underwent 
smooth reductive arylation to yield phenylcyclo- 
hexane in 79% yield in 2 hours at room tem- 
perature (Fig. 4). In this reaction, the mass 


entry 19, 95% yieldtt 


entry 4 
51% yield§ 


WP 


entry 8 
88% yield§ 


om 


entry 12 
71% yields 


“OO 


entry 16 
51% yield*$ 


F. 
Ap 
entry 20, 46% yieldt§ 
10 equiv 


8Yield determined by nuclear magnetic resonance with an internal 
standard. R, alkyl, aryl, or H; R*, alkyl, halide, or H; R?, phenyl or 


balance is composed of cyclohexylpentane iso- 
mers. Although we posit that triflate abstrac- 
tion is rate limiting, we hypothesize that the 
product-determining step is the C-C bond- 
forming event, with the barrier for attack of 
the arene a-system falling nearly 20 kcal/mol 
below the barrier for C-H insertion (fig. S44). 
In addition to benzene, electron-poor haloarenes 
such as difluorobenzene and dichlorobenzene 
underwent smooth C-C bond formation to yield 
cyclohexylated haloarenes in synthetically useful 
yields. Likewise, electron-rich arenes, such as 
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mesitylene, were competent nucleophiles. Cyclo- 
hexenyl triflates bearing substituents at the 4- 
or 5-position could also be arylated, including 
the enol triflate derived from 5a-cholestan-3- 
one, which yielded an arylated steroid core in 
90% yield and 8:1 dr. Various ring sizes were 
also competent under these reaction conditions, 
with cyclopenteny]l triflate and cyclohepteny] tri- 
flate undergoing smooth reductive alkylation 
with benzene reaction partners in 64 and 71% 
yields, respectively. Cyclobuteny]l triflate partici- 
pated in this reductive Friedel-Crafts alkylation, 
as did aromatic alkenes. The triflate derived from 
o-tetralone was reductively arylated in 43% yield, 
and acetophenone-derived acyclic triflates were 
also arylated in 51 to 77% yield (Fig. 4, entries 15 
and 16). Simple acyclic vinyl triflates were com- 
petent electrophiles for arylation by both electron- 
poor and electron-rich arenes, requiring as little 
as 10 equivalents of arene in chloroform solvent 
at —40°C (Fig. 4, entries 17 to 20). 


Outlook 


We have shown that vinyl cations, the subject 
of numerous computational and experimental 
studies, are now accessible synthetically from 
simple vinyl triflates using WCA salts under mild 
conditions. The nonnucleophilic nature of the 
WCA allows these unstabilized vinyl cations to 
engage in C-C bond-forming reactions with 
alkanes and a variety of arenes, modes of re- 
activity that have been largely unreported despite 
extensive previous work. We find that the C-H 
insertion reactions of vinyl cations proceed through 
mechanisms that feature post-transition state 
bifurcation and nonclassical ions, mechanistic 
features that are common in terpene biosynthesis 
(32) but rarely found in synthetic methodology. 
These findings lay the conceptual and experimen- 
tal groundwork for further discoveries in the field 
of alkane C-H bond functionalization using ke- 


tone derivatives and WCA catalysis. 
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An efficient way to reduce the power consumption of electronic devices is to lower the 
supply voltage, but this voltage is restricted by the thermionic limit of subthreshold 
swing (SS), 60 millivolts per decade, in field-effect transistors (FETs). We show that a 
graphene Dirac source (DS) with a much narrower electron density distribution around 
the Fermi level than that of conventional FETs can lower SS. A DS-FET with a carbon 
nanotube channel provided an average SS of 40 millivolts per decade over four decades of 
current at room temperature and high device current I¢9 of up to 40 microamperes per 
micrometer at 60 millivolts per decade. When compared with state-of-the-art silicon 
14-nanometer node FETs, a similar on-state current /,,, is realized but at a much lower 
supply voltage of 0.5 volts (versus 0.7 volts for silicon) and a much steeper SS below 


35 millivolts per decade in the off-state. 


he need for extended time of operation for 
devices powered by batteries has changed 

the electronic industry (-4). Minimal power 
consumption may be accomplished by re- 
ducing the supply voltage Vpp and by de- 
signing transistors with steeper subthreshold 
swing (SS) so that they could be switched from 
off-state to on-state at faster rates (5-7). How- 
ever, conventional Si complementary metal oxide 
semiconductor (CMOS) field-effect transistors 
(FETs) are constrained to a 60 mV/decade min- 
imum constraint on SS at room temperature (2) 
because in a typical n-type FET, electrons in the 
source are prevented from flowing to the drain 
by a potential barrier (Fig. LA) that can be tuned 
by the gate voltage. The electrons in the source 
have an energy distribution (usually a thermal 
Boltzmann distribution) that spreads to the val- 
ues exceeding the potential barrier created by 
the gate, which sets a 60 mV/decade limit on SS. 
Scaling Vpp < 0.5 V while maintaining a high 
on-state current J,,, and low off-state current Io¢¢ 
will require transistors with a different oper- 
ation mechanism. These devices include tunnel 
transistors (T-FETs) (6, 7), impact ionization FETs 
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(i-FETs) (8), and positive feedback and negative 
gate capacitance FETs (NC-FETs) (9-11). T-FETs 
have emerged as the most promising alternative, 
with the potential to overcome the thermally 
limited SS of 60 mV/decade by using a supply 
voltage below 0.5 V, thereby offering substantial 
power dissipation savings. However, all fabricated 
T-FETs have severely limited on-state current 
(typically lower than 10 1A/um), and the steepest 
switching slope SS of sub-60 mV/decade is usu- 
ally not sustained across the entire subthreshold 
region (12-16). Other kinds of sub-60 mV/decade 
transistors also suffer from vital drawbacks such 
as large hysteresis (70) and limited speed, stabil- 
ity, or voltage scalability (8, 17). 

We now consider introducing a “Dirac source” 
(DS) for the conventional FET, which is a “cold 
electron source” without a long thermal tail above 
the potential barrier in the channel. If the density 
of states (DOS) of electrons in the source is a 
decreasing function of energy, the density of 
electrons will decrease superexponentially with 
increasing energy (Fig. 1B and blue curve in Fig. 
1C), resulting in a much more localized electron 
distribution around the Fermi level Fp. This DS 
still permits a large thermionic current in the on- 
state, as in a conventional FET (see Fig. 1D). 

We chose graphene as the DS material. Graphene 
has a linear band dispersion, and when suitably 
doped it may yield a superexponentially decreas- 
ing electron density with increasing energy toward 
the Dirac point (Fig. 1B). In addition to providing 
a sharply localized carrier distribution around 
Eg, the DS should also have a clean interface with 
the channel and suitably aligned E,; and band 
edges. For a graphene source, semiconducting 
single-walled carbon nanotubes (SWCNTs) can 


form a highly transparent interface for the in- 
jection of carriers when the DS-FET is in its on- 
state, but they create a large barrier to block 
the leakage current when in its off-state. Given 
their extremely high mobility for both electrons 
and holes and their ultrathin body, CNTs have 
been used for building conventional FETs that 
show better performance than state-of-the-art 
Si MOSFETs when scaled down to the sub-10 nm 
technology node (18-24). When combined with 
graphene, a CNT FET could be scaled down to 
5 nm without suffering noticeable short-channel 
effect, and a near-thermionic limit SS value of 
60 mV/decade was demonstrated (24). 

Our DS-FET device (Fig. 2A) consisted of three 
components: (i) a semiconducting SWCNT chan- 
nel, (ii) a suitably doped “cold” graphene DS (GDS), 
and (iii) a highly efficient Y.O,/Pd top gate stack 
to provide electrostatics control of the chan- 
nel (see fig. S1). The device must set Ey in the 
n-branch of the GDS for p-type FETs or in the 
p-branch for n-type FETs, and the Schottky bar- 
rier between the graphene and CNT must be 
small or even zero when the device is switched 
to on-state (blue region in fig. SIB), because a 
large Schottky barrier will dominate the carrier 
injection process (25). 

We used a control gate (CG) to tune the Fermi 
level (Erg) of the GDS (Fig. 2, A and B). In Fig. 
2A, the DS FET was designed to be a p-type FET 
because the CNT channel contacts a Pd (p-type) 
drain. In Fig. 2C, we show a transport model 
based on a calculated band-edge profile using 
full quantum transport simulations (detailed in 
fig. S2). Beneath the control gate, the graphene 
is n-doped. When biased with a drain voltage, 
electrons are injected from the n-doped graphene 
source to the drain, passing through an n-p 
junction in graphene via tunneling before reach- 
ing a CNT bulk barrier, the height and width of 
which can be tuned by a gate on top of the CNT. 
In graphene, the n-p tunneling is Klein tunnel- 
ing and is essentially transparent (26, 27). Thus, 
the device current was mainly modulated by the 
top gate via varying the height and width of the 
CNT bulk barrier. 

In the off-state, the CNT bulk barrier width 
was >100 nm and the tunneling current through 
this barrier was negligible; the dominating con- 
tribution to the off-state current originated from 
the thermal emission current over the CNT bulk 
barrier (Fig. 2C). During the switching-off proc- 
ess, the barrier height for hole injection increased 
and simultaneously the DOS of the n-doped 
graphene source over the top of the barrier 
decreased toward the Dirac point (see Fig. 2C 
and fig. $3). Thus, the thermally activated hole 
density over the barrier in the graphene source 
decreased with decreasing energy superexponen- 
tially, leading to a slope steeper than 60 mV/ 
decade in the DS-FET (Fig. 2C). To verify the dom- 
inating operation mechanism for sub-60 mV/ 
decade SS in DS-FETs, we simulated the device 
operation theoretically using both analytical and 
numerical methods (see supplementary materials 
and fig. $1). A ballistic FET model with an n-doped 
graphene and Pd-contacted CNT was used to fit 
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Fig. 1. Electron density distribution and 
characteristics of normal and Dirac sources. 
(A and B) Schematic band structure, density 
of states, electron density, and thermionic 
electron emission over a potential barrier in 
the conduction channel for a normal source 
with a subexponential decaying of electron 
density over the potential barrier (A) and 

a Dirac source with a superexponential 
decaying and thus a more narrowly 

localized electron density around the 

Fermi level (B). The solid line in n(E) represents 
a Boltzmann distribution with exponential 
decaying tail toward higher energy; 

n(E) represents density of electrons. 

(C) Comparison of electron density 
distributions between normal sources, 

based on 3D semiconductor (green curve) 
and on 2D material (black curve), and a 
Dirac source based on graphene; the blue 
curve shows increasingly more localized 
electron distribution around the Fermi 
surface for graphene. (D) Schematic transfer 
characteristics of a conventional FET with 

a normal source and thus 60 mV/decade-limited 
subthreshold swing (SS), and a DS-FET 

with a Dirac source and much steeper SS. 
Insets are electron density distributions for 

a normal source (green) and Dirac source 
(blue), and thermionic electron emission 
over the potential barrier at low bias (off-state, lower 
inset) and high bias (on-state, upper inset). 


Fig. 2. Structure and performance of a DS-FET with a 
control gate. (A) Schematic diagram showing a DS-FET with 
a control gate in addition to the normal gate. (B) Top-view 
SEM image of a DS-FET. Scale bar, 200 nm. The devices 
are fabricated on doped Si substrate covered with a 
300-nm layer of SiOz. (©) Schematic diagrams illustrating 
the off-state of the DS-FET. (D) Transfer characteristics 

of a typical DS-FET at different values of Vcg. Circles, 
experimental results; curves, simulated results. Green 

and blue denote results obtained at Veg = 1.5 V and O V, 
respectively. Insets are schematic band edge profiles for 
fitted data situations. (E) Subthreshold swing phase diagram 
as a function of the control gate voltage, showing the 
condition for the appearance of sub—60 mV/decade 
behavior. The orange dashed line is a guiding curve of 
SS-Vecg for DS-FETs according to simulation results. Each 
colored star represents one DS-FET at the corresponding 
Voq. (F) Temperature-dependent SS of a typical DS-FET 
measured at temperatures between 77 and 300 K. 

Vcg was set at 2 V to keep the device in Dirac-source 
mode. SS varied by more than 100% from 77 to 300 K. 

In all measurements, the substrate was biased with 

-20 V to keep the ungated region near the drain open. 
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Fig. 3. Structure and performance of DS-FETs with chemically 
doped Dirac sources. (A) Schematic diagrams illustrating the 
structure of a DS-FET, with a chemically n-doped graphene source 
and Pd-contacted drain. (B) Top-view SEM image showing a DS-FET. 
Scale bar, 500 nm. (C) Transfer characteristics of a typical DS-FET 
with Vg, = -0.1 V and -0.5 V. The substrate was biased with —20 V. 


the measured transfer characteristics (Fig. 2D), 
showing excellent agreement between experi- 
mental and theoretical results. 

The two transfer characteristics shown in 
Fig. 2D correspond to two operation modes of 
the CNT FET with graphene as the source (for 
more transfer characteristics, see fig. S4). When 
the control gate was set at zero (Veg = 0 V), the 
graphene source was usually p-doped; the car- 
rier DOS increased with increasing Vcg toward 
the off-state (Fig. 2D, right inset). The device 
worked as a conventional FET with a SS of 
67 mV/decade. However, applying a large posi- 
tive Vcq to the control gate tuned the GDS to its 
n-branch (Fig. 2D, left inset). The carrier DOS in 
the GDS decreased with increasing Veg toward 
off-state. The transistor now had a SS of 46 mV/ 
decade at room temperature (Fig. 2D, green curve). 

Extensive measurements were then carried 
out on the dependence of SS on Veg for multiple 
DS-FETs. Figure 2E defines the region in which 
the CNT FET works as a DS-FET, yielding <60 mV/ 
decade, and the main trend of experimental re- 
sults is captured very well by our model simulations 
(fig. SIB). When in DS-FET mode, the transfer 
characteristics of a typical FET were measured 
at various temperatures 7 between 77 and 300 K 
(fig. S5A), and the extracted SS (Fig. 2F) showed 
a linear dependence on 7, as predicted by the- 
oretical simulations (see fig. S5B). The linear 
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and at bias of -0O.1 V. 


SS-T relation further confirmed that the main 
mechanism for electron injection over the poten- 
tial barrier at subthreshold region is thermal 
emission rather than band-to-band tunneling 
(BTBT), because for BTBT SS shows much weaker 
T dependence (2, 28). Tunneling assisted by 
phonons and mid-gap trap states may occur in 
T-FETs, resulting in 7-dependent SS (29). In 
these T-FETs, SS varies weakly with T (within a 
few percent from 100 to 300 K) and/or becomes 
much greater than 60 mV/decade at room tem- 
perature. Trap states may act as stepping stones 
in these devices that lead to band-to-band tun- 
neling in off-state and degrade SS (29, 30). 

In real applications, FETs with additional con- 
trol gates are undesirable because this structure 
introduces additional supply burden and is also 
not scalable. An alternative compact DS-FET struc- 
ture (Fig. 3A) has a chemically doped GDS. The Pd 
film was patterned as the drain contact to define 
the polarity of the FET as p-type (18). The graphene 
was intentionally doped during the chemical 
vapor deposition growth process (31), and the 
Dirac point voltage of the as-grown graphene 
was distributed around -50 V (see fig. S6A) for 
the group of samples used in this work, thereby 
confirming that the graphene was effectively 
n-doped. Part of the graphene source and CNT 
channel are beneath the top gate, and an ultrathin 
YO; is used as the gate dielectric with an equiv- 
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(D) Statistical distribution of SS for 43 DS-FETs with chemically 
doped graphene as the Dirac source. (E) Transfer characteristics 

for some representative DS-FETs with Vy, = —O.1 V and average 

SS ranging from 40 to 50 mV/decade. (F) Transfer characteristics of 
three extremely scaled DS-FETs with channel length of about 15 nm 


alent oxide thickness of 1.5 nm (32). More than 
50 such DS-FETs were fabricated; the SEM image 
of a typical DS-FET is shown in Fig. 3B. Transfer 
characteristics of the DS-FET (Fig. 3C) demon- 
strate the sub-60 mV/decade subthreshold behay- 
ior of this device. At the subthreshold voltage, the 
transistor presents a SS of 35 mV/decade at room 
temperature at both low bias (Va, = -0.1 V) and 
high bias (Va, = -0.5 V). Moreover, the sub-60 mV 
SS behavior was measured for both sweeping 
directions of V,, and showed negligible hyster- 
esis (see fig. S7). The devices achieved an average 
SS < 60 mV/decade over four decades of current 
(see Fig. 3E and fig. $7) and can thus meet the 
requirement set by the International Technology 
Roadmap for Semiconductors (ITRS) (4). 

In addition, the DS-FET based on a single 
CNT can provide high on-state current of up to 
6.5 wA at Vps = -0.5 V, which is orders of mag- 
nitude higher than that observed in a tunnel 
FET based on a single CNT (28, 33). The DS-FET 
also exhibits much suppressed ambipolar char- 
acteristics relative to conventional top-gated CNT 
FETs (34), presenting a large current on/off ratio 
of >10° at Va, = -0.5 V. More than 40 DS-FETs 
have been successfully fabricated with sub- 
60 mV/decade SS, and the statistical distribution 
of SS shown in Fig. 3D demonstrates that most 
devices (32 devices) exhibited SS < 50 mV/decade 
at room temperature. The transfer characteristics 
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Fig. 4. Comparison of DS-FETs, state-of-the-art T-FETs, NC-FETs, and Intel 14-nm MOSFETs. 
(A) Schematic diagram illustrating the structure of a DS-FET with CNT array as the channel. 

(B) On-state current /g, versus on/off current ratio for DS-FETs, state-of-the-art T-FETs, NC-FETs, 
and Intel 14-nm MOSFETs. Red and blue squares denote two typical DS-FETs; open black circles 
represent Intel 14-nm HP (high performance), SP (standard performance), and ULP (ultralow-power) 
MOSFETs (35); open green circles, IlI-V NW T-FET (13); solid green circles, MoS2-Ge T-FET (36); 
solid orange circles, squares, pentagons, hexagon, diamond, and triangles, typical NC-FETs with 
Vop < 1 V (38-43). (C) Subthreshold swing versus /ys. l60 values in the yellow region are for 
T-FETs (16); those in the cyan region are for NC-FETs with hysteresis (H) smaller than 0.5 V; and 
those in the green region are for NC-FETs with H larger than 1 V (38-45). The red, green, blue, 
and purple curves with solid symbols at the upper right represent four different DS-FETs. The black 
curves with different open symbols at the left represent different T-FETs (16). (D) Comparison 
between a 400-nm DS-FET (125 CNTs/um) and a commercial Intel 14-nm Si MOSFET. The Si MOSFET 
(black) was powered by 0.7 V (35); the DS-FET (red) was powered by a low bias of 0.5 V. 


of nine representative devices are plotted in 
Fig. 3E. 

Because miniaturization is an important issue 
for large-scale integrated circuits (ICs), we fab- 
ricated DS-FETs with ultrashort channel lengths 
of <15 nm (see figs. S8 and S9 for the relevant 
structure and fabrication process of the extremely 
scaled DS-FETs); Fig. 3F shows measurement 
results for three such DS-FETs with SS < 60 mV/ 
decade. The short-channel DS-FETs showed very 
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high on-state current of >10 uA per CNT (fig. S10), 
which is similar to that achieved in the best re- 
ported high-performance p-type conventional 
CNT FETs (78). 

The DS-FET transistor can achieve higher per- 
formance and lower dissipation than the most 
advanced Si CMOS FETs at lower supply voltage. 
Pure semiconducting CNT arrays with high den- 
sity (e.g., 125 tubes/um) have been considered as 
a good choice for channel material to construct 


scalable FETs with performance exceeding that 
of the best Si CMOS FETs (20). We projected the 
performance of DS-FETs based on a CNT array 
(Fig. 4A) by normalizing the experimental data 
on single CNTs and assuming a CNT array den- 
sity of 125 tubes/um. The key challenge for real- 
izing high-density CNT array-based DS-FETs 
results mainly from the nonuniformity of CNTs 
in the array and their contacts to graphene. 
Identical CNTs with the same chirality and highly 
uniform contacts to graphene are needed to re- 
tain the threshold voltage of every CNT in the 
channel within a required narrow range. Ideally, 
a high-performance FET should have high on- 
state current J,,, and large current on/off ratio 
at small supply voltage. Figure 4B compares the 
on-state current Ja, and on/off ratio of our DS- 
FETs and that of state-of-the-art Si MOSFETs, 
the best T-FETs, and NC-FETs (with Vpp < 1 V) 
(35-46). For the same on/off ratio, two repre- 
sentative DS-FETs (blue and red lines) at Vpp = 
0.5 V show Ja, values similar to that of the 
best Si p-type MOSFET (Intel 14-nm technol- 
ogy node) (35) driven by Vpp = 0.7 V, which is 
10 to 100 times that of the most advanced II-V 
nanowire-based heterojunction T-FETs (73, 36) 
and much greater than that of the best reported 
NC-FETs (38-43). 

An important figure of merit—Jgo, the current 
when the subthreshold swing SS equals 60 mV/ 
decade—accounts for both switching and device 
on-state characteristics (37). Higher values of Ig 
correspond to higher on-state current and lower 
average SS. To benchmark the performance of 
sub-60 mV/decade devices by using Jo, we plot 
SS-Jg, relations of the best published T-FETs, 
NC-FETs, and our CNT-based DS-FETs in Fig. 4C. 
For a sub-60 mV/decade device to have a per- 
formance comparable to that of the conven- 
tional FET, the desired target value of Igo for 
sub-60 mV/decade devices is in the range 1 to 
10 wA/ttm at Vpp below 0.5 V. However, all the 
reported T-FETs have J¢o values that are typically 
two orders of magnitude lower than the target 
(16); in particular, the most advanced InAs/Si 
nanowire (46) T-FET has an J¢9 of 0.02 uA/um 
at Va, = 1 V. As a comparison, CNT-based DS- 
FETs typically have Ig, values in the range 1 to 
40 wA/um at Vag = -0.5 V, which meets the Io 
challenge for sub-60 mV/decade devices. 

Additional challenges for T-FETs include op- 
timization of the device architecture for high J,, 
combined with an average SS lower than 60 mV/ 
decade over at least four decades. The represent- 
ative transfer characteristics shown in Fig. 3E 
amply demonstrate that the sub-60 mV/decade 
region of CNT-based DS-FETs covers more than 
four decades of current, meeting the target of 
ITRS for sub-60 mV/decade transistors. Although 
some NC-FETs show high J¢o values in the range 
1 to 40 uA/um (Fig. 4C), large hysteresis (larger 
than 1 V) remains the main challenge for their 
applications in ICs. For these reported NC-FETs 
with small hysteresis (<0.5 V), Igo is usually lower 
than 1 uA/um (38-45). As a result, high perform- 
ance and small hysteresis have not been achieved 
by the same NC-FET. 
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Figure 4D shows a direct comparison be- 
tween the CNT array-based DS-FET (with a gate 
length of 400 nm) and state-of-the-art Si MOSFET 
(Intel 14-nm technology) (35). The 400-nm CNT 
DS-FET exhibited an J,, value similar to that of 
the 14-nm Si MOSFET, but with much steeper SS 
(with an average value of about 50 mV/decade 
over four decades of current) and much lower 
supply voltage (0.5 V versus 0.7 V). The power 
dissipation of modern ICs depends strongly on 
supply voltage (which is proportional to Vpp”). 
The lowering in Vpp from 0.7 V to <0.5 V with 
DS-FETs will reduce power dissipation better 
than state-of-the-art Si-based computer chips 
while maintaining similar high performance. 

Although we only demonstrated p-type DS 
FETs, according to the symmetric energy band 
(19) we could realize an n-type DS-FET based on 
CNT by using p-doped graphene as the source 
and Sc as the drain. In principle, it is possible 
to realize complementary DS-FETs following 
procedures similar to those outlined here. 
CNT provides an excellent conduction chan- 
nel for DS-FET, but the key to the realization 
of sub-60 mV/decade SS in DS-FETs is not the 
CNT channel. In principle, GDS may also be 
combined with other semiconductor channel 
materials [e.g., semiconducting nanowires, 
two-dimensional (2D) semiconductors, or even 
conventional bulk semiconductors] to simulta- 
neously achieve sub-60 mV/decade SS and high 
Igy. The DS-FET may thus be used as a general 
device or building block for future ICs with sub- 
0.5 V power supply. 
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The hippocampal engram maps 
experience but not place 


Kazumasa Z. Tanaka’, Hongshen He””, Anupratap Tomar'}, Kazue Niisato’, 


Arthur J. Y. Huang’, Thomas J. McHugh?* 


Episodic memories are encoded by a sparse population of hippocampal neurons. In mice, 
optogenetic manipulation of this memory engram established that these neurons are 
indispensable and inducing for memory recall. However, little is known about their in vivo 
activity or precise role in memory. We found that during memory encoding, only a fraction 
of CA1 place cells function as engram neurons, distinguished by firing repetitive bursts 
paced at the theta frequency. During memory recall, these neurons remained highly 
context specific, yet demonstrated preferential remapping of their place fields. These data 
demonstrate a dissociation of precise spatial coding and contextual indexing by distinct 
hippocampal ensembles and suggest that the hippocampal engram serves as an index 


of memory content. 


umerous theories have attempted to link 

the physiology of the hippocampus with 

its role in episodic memory (J, 2); however, 

experimental tests of these ideas remain 

scarce. One prevailing model suggests that 
the hippocampal memory trace contains rich 
information about the animal’s current location 
within the cognitive domain, providing a spatial 
framework on which events and items can be 
anchored and related [cognitive map theory 
(3, 4)]. This model is supported by data demon- 
strating that synaptic plasticity stabilizes these 
hippocampal maps and that animals can reliably 
recall these representations, even at remote time 
points (5, 6). An alternative, although not mu- 
tually exclusive, hypothesis, the memory index 
theory, asserts that the hippocampal memory 
trace is primarily an index that provides rapid 
and efficient access to the content of an episodic 
memory stored in the neocortex (7, 8). This the- 
ory is agnostic to the information content in 
the hippocampus, emphasizing its role in re- 
activating downstream cortical modules, with 
plasticity serving to establish a link between the 
hippocampal index and the neocortical activity 
pattern. 

Although place cell physiology supports the 
cognitive map theory (9), behavioral studies 
using contextual fear conditioning [CFC (J0)] 
may be better interpreted via the memory index 
theory. In CFC, the contextual representation 
is index-like because its formation requires 
conjunctive exposure to cues and its rapid re- 
call does not require physical exploration of the 
environment (17). Moreover, the expression of 
activity-induced genes, such as c-Fos, is rapidly 
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and robustly induced in a specific neuronal en- 
semble after presentation of novel configura- 
tions of stimuli and is thought to define the 
neuronal substrates of the contextual repre- 
sentation (72, 13). Recent studies (14) further 
strengthen this view, demonstrating that inhi- 
bition of hippocampal engrams can block mem- 
ory recall in contextual tasks, whereas activation 
can drive context-triggered behavior (15, 16). The 
link between this contextual memory represen- 
tation and precise locations or routes represented 
by place cells during exploration remains unknown. 

We therefore conducted tetrode recordings 
from CAI pyramidal cells in freely moving c-Fos- 
tTA (tetracycline-transactivator) transgenic mice 
infused with virus expressing channelrhodopsin2 
and enhanced yellow fluorescent protein under 
the control of the Tet-responsive element (AAV- 
TRE-ChR2-EFYP). Doxycycline was removed from 
the diet of the mice, triggering labeling of c-Fos- 
expressing (positive) cells with ChR2, and place 
cell activity was recorded as mice explored a novel 
context (A; encoding context; Fig. 1A). The next 
day (12 to 14 hours later; fig. S1), the animals were 
reexposed to context A (recall) to examine the 
stability of the spatial map. CA1 was stimulated 
with pulses of blue light (10 mW, 0.5 Hz, 15 ms) 
to identify the subset of cells that expressed 
ChR2 as a result of c-Fos expression during the 
first session (Fig. 1B and fig. $2). In a separate 
cohort of mice, we verified that this OptID pro- 
tocol did not affect excitability or spatial coding 
properties of the labeled neurons (fig. $3). Across 
the seven animals used in this study, 19.59 + 
2.65% of putative pyramidal cells exhibited light- 
induced spikes and thus were identified as c-Fos 
positive (Fig. 1C). This fraction is consistent with 
previous reports (17). Finally, the same mice 
explored a distinct environment (B) to examine 
context-specific activity (18). 

We first examined the spatial firing of c-Fos- 
positive neurons as the animals explored the nov- 
el context A. The majority (79.31%) of these cells 
had place fields in the labeled context (Fig. 1D; 


see methods for place field criteria). However, 
a large fraction (75.53%) of the active place cells 
were not labeled with ChR2 (Fig. 1E). Although 
physiologically similar, with indistinguishable 
peak firing rates (Fig. 1F; pos: 6.99 + 1.06 Hz; 
neg: 6.06 + 0.53 Hz, W = 640, p = 0.20), c-Fos- 
positive place cells exhibited significantly higher 
mean firing rates (Fig. 1G; pos: 1.43 + 0.22 Hz; 
neg: 0.77 + 0.11 Hz, W = 357, p = 5.4 x 10°). 
Accordingly, positive cells had larger place fields 
(Fig. 1H; pos: 28.00 + 3.50 cm?; neg: 17.97 + 
1.79 cm?, W = 517, p = 0.0085) and on average, 
their spikes carried lower spatial information 
(Fig. 11; pos: 0.69 + 0.81 bits/spike; neg: 1.21 + 
0.83 bits/spike, W = 1191, p = 0.001). 

We next analyzed the temporal structures of 
spike activity during exploration (A, encoding). 
Interspike interval (ISI) analysis revealed that 
spikes from engram cells were more likely to 
occur in bursts (3- to 15-ms ISI) (Fig. 2A). These 
neurons had significantly higher burst rates 
(Fig. 2B; pos: 16.80 + 3.04 bursts/min; neg: 8.87 + 
1.44 bursts/min, W = 1243, p = 0.00018) and 
shorter average interburst intervals (IBIs) (Fig. 2C; 
IBI: pos: 6.55 + 1.30 s; neg: 16.27 + 2.15 s, W = 415, 
p = 0.00042). Burst events in c-Fos-positive neu- 
rons were preferentially spaced at ~125 ms, an 
interval corresponding to the 8-Hz theta rhythm 
in the hippocampus; thus, bursts from these neu- 
rons were significantly more theta modulated 
than bursts from c-Fos-negative cells (Fig. 2, D to 
F; theta modulation index, W = 418, p = 0.0009; 
theta power in burst spectrum, W = 1151, p = 
4.8 x 10°). We defined theta-burst events (TBEs) 
as bursts repeated at IBIs of 83 to 167 ms (6 to 
12 Hz) and compared this property across the 
groups. During exploration of the novel context 
A, c-Fos-positive cells showed higher rates of 
TBEs than c-Fos-negative place cells (Fig. 2, G 
and H; pos: 5.44 + 1.41 bursts/min; neg: 2.58 + 
0.53 bursts/min; W = 417.5, p = 0.00046) and 
longer repetitions of bursts (Fig. 2, I and J; pos: 
2.68 + 0.095 bursts/event; neg: 2.56 + 0.052 bursts/ 
event; W = 652.5, p = 0.20; Fig. 21, max TBE 
length: pos: 7.13 + 0.78 bursts; neg: 5.28 + 
0.34 bursts, W = 542, p = 0.022). These burst 
features were no longer prominent in ChR2- 
labeled neurons when the animals explored 
context B (fig. S5), suggesting their involvement 
in contextual encoding. 

We then examined the temporal modulation 
of place cell spiking by the oscillatory population 
activity in the local field potential (LFP) (A, en- 
coding). CAl neurons can be entrained by the 
theta (6 to 12 Hz) rhythm, as well as by slow (30 
to 50 Hz) and fast (55 to 85 Hz) gamma os- 
cillations, which correlate with CA3 and ento- 
rhinal cortical inputs to CA1, respectively (19). 
Spikes from c-Fos-positive and -negative cells 
were similarly theta entrained and preferred 
the ascending phase of the oscillation (Fig. 3, A 
and B; mean preferred phase, pos: 229.9 + 1.73°; 
neg: 199.2 + 1.85° W = 0.42, p = 0.81). However, 
TBE spikes in positive neurons preferentially 
occurred during the descending phase, whereas 
those in negative neurons remained locked to 
the ascending phase (Fig. 3, C to E; mean 
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preferred phase: pos: 170.7 + 1.86°; neg: 325.7 + 
1.54°, W = 62.66, p = 2.5 x 10“). Next, to address 
any differences in coupling of positive and neg- 
ative neurons to CA1 inputs, we examined spikes 
during gamma events. Significantly more spikes 
from positive neurons occurred during fast gam- 
ma events, and a larger fraction of these neurons 
were phase locked to fast gamma oscillations com- 
pared to the c-Fos-negative population (Fig. 3F; 
percentage of spikes: pos: 2.62 + 0.17%; neg: 
2.19 + 0.11%, W = 1067, p = 0.0072; percentage of 
phase-locked cells: pos: 26.09%; neg: 14.08%, chi- 
squared test, p = 0.0014). There was no signif- 
icant difference in spiking or phase locking 
between the groups during slow gamma (Fig. 3G; 
percentage of spikes: pos: 5.81 + 0.31%; neg: 
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5.53 + 0.23%; W = 888, p = 0.53; percentage of 
phase-locked cells: pos: 26.09%; neg: 29.58%, 
chi-squared test, p = 0.62). These data suggest 
that although both populations of neurons are 
similarly tuned to CA3-driven excitation, c-Fos- 
positive neurons may be more responsive to in- 
put from the entorhinal cortex, resulting in the 
temporal shift in their distinctive theta-paced 
bursting. 

Finally, we examined hippocampal activity 
during memory recall and assessed the stability 
of the spatial representation of engram cells when 
animals revisited the labeling context (A). Con- 
trary to expectations of a spatial memory trace, 
many of these cells shifted their firing locations 
during the second visit (Fig. 4A and fig. S6). In 
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positive neurons, the average correlation of the 
firing-rate maps from the encoding and recall 
sessions was close to zero, significantly lower 
than that from negative place cells (Fig. 4B; pos: 
0.079 + 0.070; neg: 0.31 + 0.052, W = 560, p = 
0.010). However, correlations of the mean firing 
rate, independent of position, were similar across 
both populations of cells (Fig. 4C; pos: 0.36 + 
0.053; neg: 0.36 + 0.029, W = 412, p = 0.81), 
suggesting that c-Fos-positive neurons repre- 
sent contextual information in a different man- 
ner. Therefore, we examined their activity in a 
distinct context. When animals explored context 
B, many c-Fos-positive cells remained silent 
(Fig. 4A and fig. S6; 43.5% of positive place cells 
had a peak rate of <1 Hz in context B; peak 


Fig. 1. c-Fos induction in a fraction of 
place cells during exploration of a novel 
context. (A) A schematic of the experimen- 
tal protocol. Mice were exposed to a novel 
context (A, encoding) when doxycycline 

was removed from the diet (OFF DOX), 
followed by reexposure to the same context 
(A, recall). Twenty-four hours after A (encod- 
ing), labeled cells were identified through 
light-induced spikes. Finally, mice explored a 
distinct context B (novel). (B) A representa- 
tive image of ChR2-EYFP (green) expression 
in the dorsal CA1 of the hippocampus. 

The white arrowhead indicates a tetrode 
location. DAPI, 4’,6-diamidino-2-phenylindole. 
(C) Peristimulus time histogram of representa- 
tive single units classified as c-Fos—labeled 
(pos) or not labeled (neg). Blue area repre- 
sent light on epoch (15 ms). (D) Firing-rate 
maps of representative place cells showing 
location-specific firing during exploration in 
context A (encoding). (E) Pie plots describing 
percentages of each cell type in recorded 
animals (N = 7), and their average (dashed 
square). (F to 1) Cumulative density plots 
with inset box plots comparing peak firing 
rates (F), mean firing rates (G), place field 
size (H), or spatial information (1) 

between c-Fos-—positive (orange) 

and —negative (black) place cells. Box 
plots show median, first, and third 
quantiles, and minimum and maximum 
values within 1.5x the interquantile range 
(IQR) from each quantile. 
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firing rate in context A: 6.99 + 1.06 Hz; context 
B: 3.42 + 0.76 Hz, W = 392, p = 0.005). Con- 
sequently, there were fewer c-Fos-positive place 
cells, and the spatial information of these neurons 
was smaller than in context A (Fig. 4D, Friedman 
rank sum test, Friedman chi-squared = 6, p = 
0.014; two-way analysis of variance, significant 
interaction, Fqg9) = 4.9, p = 0.029; Fig. 4E, 


Fig. 2. Theta-paced burst 
activity in c-Fos—positive place 
cells. (A) Histograms showing 
densities of interspike intervals 
from all c-Fos—positive (orange) 
or —negative (black) place 
cells. (B and ©) Cumulative 
density plots and box plots 
comparing burst rates (B) and 
mean interburst intervals (C) of 
the two cell types. (D) Autocor- 


spatial information: context A: 0.89 + 0.21 bit/s; 
context B: 0.32 + 0.071 bit/s, W = 396, p = 0.0033). 
By contrast, the average spatial information car- 
ried by c-Fos-negative place cells was unchanged 
(Fig. 4F; context A: 0.51 + 0.064 bit/s; context B: 
0.37 + 0.044 bit/s, W = 2643, p = 0.30), although 
these cells typically shifted the location of their 


spatial firing (“remapped”) in the new context 
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(Fig. 4G; spatial correlation, A/A: 0.31 + 0.052; 
A/B: 0.013 + 0.052, W = 982, p = 0.00029). These 
data are consistent with the hypothesis that en- 
gram cells do not necessarily represent reliable 
spatial information about the external world, but 
rather through their net activity serve as an index 
to episodic information stored elsewhere in the 
brain (7, 8). We therefore sought to discriminate 
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Fig. 3. LFP modulation of spikes from c-Fos—positive and —negative 
place cells. (A) Spike probability of the two types of cells over phases of 
theta oscillation (6 to 12 Hz) [positive (orange) or negative (black); 

only significantly phase-locked cells were used]. Envelopes represent 
confidence intervals; the dashed line represents theta. (B) Mean resultant 
length of c-Fos—positive (orange) or —negative (black) cells plotted as a 
function of preferred theta phase. Dot size represents mean firing rate of 
that cell. Only significantly phase-locked cells are plotted (positive: 

22 cells; negative: 68 cells). (©) Example spike raster plots with LFP traces 
[raw, theta filtered (6 to 12 Hz), gamma filtered (30 to 85 Hz)]. Red arcs 
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represent a TBE occurring at the descending phase of theta. (D) A rose 
plot showing TBE spike probability across theta phases [positive 
(orange) or negative (black), 10° bins]. (E) A rose plot showing spike 
probability of c-Fos—positive place cells over theta phases (TBE spikes are 
in blue, all spikes in red). (F) Percentage of spikes occurring during fast 
gamma events (top), and percentage of cells phase locked to fast 
gamma oscillations [bottom; positive (orange) or negative (black)]. 
(G) Percentage of spikes occurring during slow gamma events (top), 
and percentage of cells phase locked to slow gamma oscillations [bottom; 
positive (orange) or negative (black)]. 
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Fig. 4. Different responses of c-Fos—positive and —negative place cells 
to the encoding context and a distinct novel context. (A) Firing-rate maps 
of c-Fos—positive (top) or —negative (bottom) place cell activity during 
encoding (left), reexposure to the encoding context (middle), or exploration 
of a distinct novel context (right). (B) Spatial correlation between place 
maps in A (encoding) and A (recall) [positive (orange) or negative (black)]. 
(C) Mean firing-rate correlation between place maps in A (encoding) and 

A (recall) [positive (orange) or negative (black)]. (D) An animal-by-animal box 
plot showing observed overlap between c-Fos-positive cells and place cells 
normalized by those expected by chance, comparing context A (encoding, 


A (encoding; green) or B (novel; blue). (G) Spatial correlations of 
c-Fos—negative place cells in A (encoding; green) and B (novel; blue). 

(H) Distribution of ensemble firing-rate correlations of c-Fos—positive 

or —negative place cells between context A (encoding) — A (recall) (green) 
versus A (encoding) — B (novel) (blue). (1) Distribution of discrimination 
indexes obtained by c-Fos—positive (orange) or —negative (black) ensemble 
firing-rate correlations. (J) Instantaneous ensemble firing-rate correlations of 
the initial 6 min of the context A (recall) session binned by 1 min (left), or that 
of the first 10 s (right). c-Fos—positive place cells are in orange, c-Fos— 
negative cells in black. For (H) to (J), values are plotted as probability 


green) versus B (novel, blue) sessions. Spatial information of (E) c-Fos— 
positive or (F) c-Fos—negative place cells when animals explore context 


contexts only from firing-rate correlations of a 
subpopulation of neurons. We randomly sub- 
sampled c-Fos-positive neurons and an equivalent 
number of c-Fos-negative place cells and cal- 
culated the correlation of the mean firing rates 
of each population between context A (encoding) 
and A (recall) or between context A (encoding) 
and B (novel), repeating the procedure 1000 times 
(see methods). As a population, the firing rate 
of c-Fos-positive place cells reliably discriminated 
the two contexts, whereas the firing rate of the 
ensemble of negative cells did not (Fig. 4H; prob- 
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ability of finding a higher correlation in A/B 
than A/A, p = 0.002 for positive ensemble, p = 
0.57 for negative ensemble). This was also evident 
in the discrimination index (Fig. 41; probability of 
finding a greater index in the negative than the 
positive ensemble, p = 0.049). In contrast to the 
negative ensemble, context discrimination from 
firing-rate correlations of c-Fos-positive cells was 
prominent early in the recall session, demonstrat- 
ing high correlation to the average rate of the 
encoding session during the first few minutes, 
or even the first 10 s (Fig. 4J and fig. S7; prob- 


densities (scaled). Box plots show median, first, and third quantiles, and 
minimum and maximum values within 1.5x the IQR from each quantile. 


ability of finding a higher correlation in the 
negative than the positive ensemble, p = 0.055, 
0.038, 0.011, 0.230, 0.906, and 0.754 for 1- to 6-min 
bins, p = 0.046 for the first 10-s bin), suggesting 
the rapid reactivation of contextual representa- 
tion by the subset of engram neurons. These data 
demonstrate that engram cells are place cells 
with lower spatial stability and accuracy, but 
with activity that can reliably and quickly reflect 
contextual identity. 

When animals explore a novel context, an en- 
gram is formed by a fraction of the place cells, 
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suggesting that the plasticity involved in engram 
formation is distinct from that for formation of 
place fields. During encoding in the labeled con- 
text, engram neurons demonstrate repetitive 
theta-frequency bursts. In vitro, this pattern of 
action potentials is known to produce a form 
of long-term potentiation mediated by brain- 
derived neurotrophic factor (BDNF) secretion 
and dependent on c-Fos expression (20, 27). c-Fos 
expression and theta bursts have been observed 
in CA1 when animals perform a non-spatial 
memory task (22, 23), suggesting that this ac- 
tivity may be a signature of hippocampal engrams 
indexing broader types of memories. 

Our study does not support the idea that the 
hippocampal engram simply encodes spatial 
memory because these neurons do not maintain 
their firing locations in the same environment 
across time. Instead, we propose that these cells 
may serve as an index for contextual memory 
through a shift of firing rates. Compared to other 
place cells, during recall these c-Fos-positive 
neurons rapidly spike at rates that are highly 
correlated to their activity during the encoding 
period and are preferentially silent in a distinct 
context. This feature may explain why optogenetic 
stimulation of the hippocampal engram can 
influence contextual memory recall (fig. S8) (24), 
despite lacking temporally meaningful pattern 
of spikes. Inhibition of the CAl engram reduces 
downstream cortical reactivation (25), consistent 
with a role in indexing. Our data suggest that spe- 
cific ensembles of hippocampal neurons simul- 
taneously support distinct domains of episodic 
memories—spatially reliable c-Fos-negative neu- 
rons for configurational coding and spatially un- 
stable engrams for contextual indexing. 
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EUSOCIALITY 


Social regulation of insulin signaling 
and the evolution of eusociality 


in ants 


Vikram Chandra’*+, Ingrid Fetter-Pruneda’*{, Peter R. Oxley”, Amelia L. Ritger’, 
Sean K. McKenzie’, Romain Libbrecht’*, Daniel J. C. Kronauer™* 


Queens and workers of eusocial Hymenoptera are considered homologous to the 
reproductive and brood care phases of an ancestral subsocial life cycle. However, the 
molecular mechanisms underlying the evolution of reproductive division of labor remain 
obscure. Using a brain transcriptomics screen, we identified a single gene, insulin-like 
peptide 2 (ilp2), which is always up-regulated in ant reproductives, likely because they are 
better nourished than their nonreproductive nestmates. In clonal raider ants (Ooceraea 
biroi), larval signals inhibit adult reproduction by suppressing ilp2, thus producing a colony 
reproductive cycle reminiscent of ancestral subsociality. However, increasing ILP2 peptide 
levels overrides larval suppression, thereby breaking the colony cycle and inducing a stable 
division of labor. These findings suggest a simple model for the origin of ant eusociality via 
nutritionally determined reproductive asymmetries potentially amplified by larval signals. 


usocial insects exhibit a reproductive divi- 

sion of labor in which queens lay eggs and 

workers perform other tasks (1). Eusociality 

in ants, and in many other Hymenoptera, 

likely evolved from a subsocial state in which 
a female wasp would lay an egg and then care for 
the resulting larva until pupation (1-3). Such 
brood care may have been induced by larval sig- 
nals, and observations of extant subsocial wasps 
are consistent with this scenario (2-4). This tem- 
poral reproductive and behavioral plasticity 
was then modified into a fixed reproductive 
asymmetry between queens and workers in 
eusocial colonies (2, 5). This raises three impor- 
tant mechanistic questions: (i) How are subsocial 
reproductive cycles regulated? (ii) How is the 
eusocial reproductive division of labor regulated— 
ie., what allows queens to lay eggs but prevents 
workers from doing so? (iii) What is the evolu- 
tionary trajectory that gave rise to fixed eusocial 
division of labor from subsocial cycles? Here we 
suggest that, in ants, evolutionary innovations in 
insulin signaling may have played a crucial role 
in each case. 

Eusociality evolved once in a common ances- 
tor of ants and, with the exception of a few derived 
social parasites, all extant ants are eusocial (6) 
(Fig. 1). To identify conserved potential regula- 
tors of division of labor between reproduction 
and brood care in ants, we conducted an unbiased 
screen for differentially expressed genes between 
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whole brains or heads of reproductives and non- 
reproductives across seven ant species, including 
four previously published datasets (Fig. 1 and 
tables S1 and S82) (7-11). We sampled a range 
of reproductive strategies, from species with 
morphologically distinct queens and workers to 
queenless species. Among all 5581 identified 
single-copy orthologs, we found only one such 
gene: insulin-like peptide 2 (ilp2). ilp2 was always 
significantly up-regulated in reproductives (Fig. 1). 
Thus, the differential expression of z/p2 is likely 
conserved across ants. Consequently, the most 
recent common ancestor of ants likely had ilp2 
expression that was high in reproductives and 
low in nonreproductives. 

Although our approach is conservative and 
probably misses genes, it has the advantage of 
eliminating false positives. When we relaxed the 
statistical stringency for classifying genes as dif- 
ferentially expressed, our screen still returned 
ilp2 as the single candidate gene (fig. S1). Relaxing 
other inclusion criteria revealed additional genes 
that might be expected to vary with reproductive 
state. For example, a total of 24 genes were con- 
sistently differentially expressed in subsets of five 
of the seven studied species (fig. S2 and table S3). 
This list includes insulin-like peptide 1 (ilp1), as 
well as other genes implicated in insulin signaling 
(fig. S3 and table S3). Non-single-copy orthologs 
were excluded from our screen. One example 
is vitellogenin (vg), a gene that has undergone 
repeated duplications in ants (72). The vitello- 
genin protein is a lipid carrier that provisions 
developing oocytes with yolk and constitutes a 
reliable indicator of female reproductive activ- 
ity (72, 13). Studies of bees and other insects have 
shown that vitellogenin interacts with insulin 
signaling (14-16). vg indeed showed consistently 
higher expression in reproductives in our screen, 
even though this difference was not statistically 
significant in two of the ponerines (fig. S3). These 
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findings further bolster the conclusion that insu- 
lin signaling played a major role in the evolution 
of reproductive division of labor in ants. 

Insulin regulates reproduction and food-seeking 
behavior across a wide range of organisms, making 
it a prime candidate for the regulation of sub- 
social cycles and eusocial division of labor (17). 
Most studied hymenopterans have two ILPs: ILP1 
and ILP2 (fig. S4). Whereas ILP1 resembles insulin- 
like growth factor, ILP2 is similar to canonical 
insulin (fig. S5) (17). In other holometabolous 
insects, these ILPs regulate larval growth, adult 
metabolism, and reproduction (17-19). More- 
over, caste determination in most ant species 
relies on nutritional asymmetries during devel- 
opment: Queen-destined larvae eat more than 
worker-destined larvae, which likely explains 
how queens acquire higher ILP2 levels (20). A 
study of Diacamma sp. found that the asymmetry 
in reproductive potential between ants was cor- 
related with insulin receptor expression in the 
ovaries (27). This suggests a possible secondary 
mode of reproductive control downstream of 
ILPs that may augment the initial reproductive 
asymmetry reflected by differential z/p2 expres- 
sion in the brain. 

ILPs have not been studied functionally in 
eusocial insects in the context of reproductive 
division of labor between adults. However, insu- 
lin signaling has been implicated in other contexts, 
such as caste development and nonreproductive 
division of labor (18, 22-24). Current data from 
wasps and bees do not typically indicate that i/p2 
is differentially expressed between adult queens 
and workers, suggesting that this expression pat- 
tern may be ant specific (table S5). This apparent 
inconsistency may be explained by the fact that 
eusociality has evolved independently in ants, 
bees, and wasps (1). Therefore, though insulin 
signaling may have been co-opted repeatedly 
during social evolution, the details likely differ 
between independent lineages. 

We used the clonal raider ant Ooceraea biroi 
to study ILP2 in ants. O. birot has secondarily 
lost queens, resulting in a species in which workers 
reproduce synchronously and asexually (13, 25). 
Colonies alternate between reproductive and 
brood care phases. This colony cycle is regulated 
by the periodical presence of larvae, which 
suppress reproduction and induce brood care 
behavior in adults, and is reminiscent of the 
subsocial cycle presumed to precede eusociality 
in ants. Despite this unusual biology, O. biroi is 
eusocial. Workers display cooperative brood 
care, colonies contain overlapping generations 
of adults, and reproductive asymmetry exists with- 
in colonies (25). 

We found that antibody staining of ILP2 ex- 
clusively localized to the brain, primarily in a 
single medial cluster of ~15 cells in the pars 
intercerebralis (Fig. 2, A to C, and fig. S6). These 
insulin-producing cells (IPCs) coincide in loca- 
tion with those of other insects (26, 27). Axons 
likely project to the corpora cardiaca, the only 
other brain region staining positive for ILP2 
(figs. S6 to S8). We quantified ILP2 in the IPCs 
and found that its levels are higher in the brood 
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care than in the reproductive phase (Fig. 2D and 
fig. S6). Peptide levels are thus anticorrelated 
with transcription. This pattern is known from 
Drosophila melanogaster, in which the rate of 
ILP secretion correlates with the rate of t/p tran- 
scription (27). This suggests that the mechanisms 
of ilp expression and ILP secretion are conserved 
in holometabolous insects. 

Because larvae regulate the O. biroi colony 
cycle, we asked whether larval communication 
altered ilp2 expression in adults. When larvae 
are removed from colonies in the brood care 
phase, z/p2 expression levels in adult brains in- 
crease markedly within 12 hours (Fig. 2E) (28). 
This increase occurs under identical nutritional 
conditions. Conversely, when ants in the repro- 
ductive phase are given larvae, their i/p2 levels 
decrease (Fig. 2E). ugg, the vitellogenin gene up- 
regulated in ant queens, responds similarly, albe- 


it more slowly, to these changes (fig. S9A), raising 
the possibility that ILP2 regulates reproduction 
at least partly by acting on vgq. Although this 
experiment is highly suggestive, the addition of 
larvae was always correlated with the removal of 
pupae, and changes in expression occurring after 
the 24-hour time point were confounded by nu- 
tritional differences. We therefore repeated this 
experiment without pupae and under nutrition- 
ally controlled conditions. We removed larvae 
from colonies in the brood care phase, waited 
until the ants in these colonies activated their 
ovaries, and then compared brain gene expres- 
sion between these and control colonies. Again, 
the removal of larvae increased i/p2 (Fig. 2F) and 
ogq (fig. S9B) expression. This finding suggests 
that social signals can mediate insulin signaling 
independently of internal nutritional state and 
that this is a key regulatory mechanism underly- 
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ing the O. biroi colony cycle. Given the conserved 
association of caste and i/p2 expression in all ants, 
social regulation of 7p2 may also underlie the life 
cycle of the subsocial ancestor. 

In D. melanogaster, insulin signaling is neces- 
sary and sufficient to regulate the terminal dif- 
ferentiation of germline stem cells into oocytes. 
Moreover, it promotes yolk uptake in developing 
oocytes and is crucial for ovary activation (29). It 
is therefore plausible that the differential expres- 
sion of z/p2 in ants has a causal role in regulating 
ovary activation and reproductive division of labor. 
We further hypothesized that if the regulation of 
ilp2 were freed, at least partially, from larval 
control, this would yield ants whose physiology 
is less susceptible to reproductive suppression. 
Such a mechanism would allow the evolution 
of distinct reproductive and nonreproductive 
castes from an ancestral subsocial cycle. To test 


i Fig. 1. Brain gene expression 
o. in seven ant species identifies 
see . one conserved differentially 

Me expressed gene. The figure 


: shows the summary cladogram 
of the seven ant species used in 
this study in the context of the 
entire ant phylogeny with all 
subfamilies labeled. Five of the 
focal species have queens, 
whereas two (Dinoponera 
quadriceps and O. biroi) are 
queenless. Although Harpegnathos 
saltator is not queenless, the 
data compared reproductive 
and nonreproductive workers 
(table S1). The dot plots show 
variance-stabilized transformed 
read counts for ilp2. Blue and 
orange dots indicate reproduc- 
tive (R) and nonreproductive 
(NR) ants, respectively. Hori- 
zontal bars indicate means, and 
asterisks indicate statistically 
significant differences between 
groups (Wald test: *P < 0.05; 
tt ***P < 0.001). All photos 
except for those of Acromyrmex 
echinatior are from A. Nobile, 
= S. Hartman, and E. Prado 
(www.antweb.org). Scale bars, 
2 mm. The phylogeny is 
based on (30). Species 
numbers are from (6). 
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Reproductive Brood care 


Fig. 2. Larvae regulate ilp2 in adults. (A to C) Immunohistochemistry 
with antibody against ILP2 (anti-ILP2) on an O. biroi brain localizes ILP2 
peptide to a single cluster of insulin-producing cells (IPCs) in the pars 
intercerebralis (body-axis dorsal view). Green, anti-ILP2; blue, DAPI (4’,6- 
diamidino-2-phenylindole); magenta, phalloidin. MB, mushroom body; 
AL, antennal lobe. (D) Total intensity of ILP2 in the IPCs is higher in the brood 
care phase than in the reproductive phase (n = 14 ants, t test; *P = 0.046). 
a.u., arbitrary units. (E) RNA sequencing (RNA-seq) time course shows 
that the addition of larvae down-regulates i/p2, whereas the removal 

of larvae up-regulates i/p2 [n = 4 biological replicates, time-transition 


this hypothesis, we injected synthetic O. biroi 
ILP2 mature peptide into workers in colonies 
with larvae. As a control, we injected the inactive 
B chain of this peptide (fig. SILA) (19). Injecting 
ILP2 mature peptide caused strong ovary activa- 
tion despite the presence of larvae (Fig. 3, A to C, 
and fig. S10A). Higher doses of ILP2 caused ants 
to develop more eggs simultaneously (fig. S10, B 
and C), suggesting that quantitative differences 
in ILP2 levels vary the ants’ positions along a 
spectrum of reproductive potential. To ensure that 
ILP2 does not have inhibitory effects during the 
opposite phase of the colony cycle, we injected 
ants in the reproductive phase with ILP2 and 
found no detectable effect on ovary state (fig. S11, 
B and C). 

Finally, we hypothesized that as developmen- 
tal nutritional asymmetries determine caste in 
most ants, this might be a general and natural 
mechanism that produces asymmetries in baseline 
adult ILP2 levels and consequently in reproductive 
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potential. Whereas most O. biroi workers have two 
ovarioles, some individuals (“intercastes”) have 
four or more (25) (fig. S12, A and B). We found 
that these differences can be determined by the 
amount of food a larva receives (fig. S13). Inter- 
castes have longer and more active ovaries com- 
pared with those of regular workers in the brood 
care phase, suggesting intercastes are less sensi- 
tive to larval signals that suppress ovarian ac- 
tivity (Fig. 4A and fig. S12C). This finding is 
consistent with previous work showing that 
some intercastes fail to regress their ovaries 
during the brood care phase (25). Additionally, 
we found that the IPCs of intercastes contained 
more ILP2 than those of regular workers (Fig. 4, 
B and C). As we have shown above, ILP2 peptide 
levels are negatively correlated with ilp2 expres- 
sion, ovary state, and, by extension, circulating 
ILP2 levels in workers between the different 
phases of the cycle, probably owing to higher rates 
of peptide release during the reproductive phase 
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Larvae 


No larvae 


interaction, likelihood ratio test with 5% false discovery rate (FDR) 
correction; P < 107°]. The arrow indicates when ants with larvae 

were fed (i.e., changes in expression beyond that time point are 
confounded by differences in nutrition). Error bars depict SEM. Data are 
from (28). (F) RNA-seq on ant brains shows that under nutritionally 
controlled conditions, i/p2 is up-regulated 8 days after larvae are 
removed from O. biroi workers in the brood care phase (n = 4 biological 
replicates, Wald test with 5% FDR correction; ***P < 10°). Data 

are variance-stabilized transformed read counts. Horizontal bars 


(Fig. 2D). The phase-matched comparisons be- 
tween different types of workers, on the other 
hand, show that intercastes consistently have 
higher ILP2 levels in their IPCs, and, given their 
more active ovaries and decreased sensitivity to 
larval signals (25), it is likely that they also have 
consistently higher levels of ILP2 in circulation. 
How the ancestral subsocial cycle was regu- 
lated remains unknown. However, assuming that 
similar mechanisms underlie the O. biroi colony 
cycle, our findings suggest a plausible scenario for 
the evolution of ant sociality. First, during the 
transition from solitary to subsocial life, some 
signaling systems (probably including insulin 
signaling) in adults must have become respon- 
sive to larval signals. This shift allowed behav- 
ioral and physiological responses in adults to 
be appropriately modified for the nutritional 
requirements of the larvae. During the transi- 
tion from subsocial to eusocial life, increased 
developmental variation may have caused some 
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Fig. 3. ILP2 supplementation overrides larval suppression of adult 
reproduction. (A) Workers injected with 100 uM ILP2 in the brood care 
phase activate their ovaries relative to controls injected with 100 uM ILP2 
B chain, despite being in contact with larvae [n = 10, Welch's t test with 
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Fig. 4. Intercastes respond less to larvae and have more ILP2 than 
regular workers. (A) Intercastes have more active ovaries than age- 
matched regular workers in the brood care phase, despite both groups 
being in contact with larvae (n = 16, Welch's t test; ***P < 0.0001). (B) In 


adults to emerge from the pupa with low nu- 
tritional stores and low ILP2 levels. These sub- 
fertile individuals would have been more sensitive 
to larval signals that suppress reproduction and 
would consequently have foregone nest founding 
and ovary activation and instead assumed brood 
care roles. Other adults, meanwhile, would have 
emerged with high nutritional stores and high 
ILP2 levels. These adults would have had reduced 
sensitivity to larval signals and would have been 
more likely to reproduce despite the presence of 
larvae. This reproductive asymmetry could then 
have been enhanced or modified by natural selec- 
tion to ultimately produce the obligately repro- 
ductive queens and sterile workers of advanced 
eusocial species (fig. S14). This scenario consti- 
tutes an explicit molecular version of Mary Jane 
West-Eberhard’s model for the evolution of hyme- 
nopteran eusociality (5). 
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Lectins modulate the microbiota 


of social amoebae 


Christopher Dinh’”, Timothy Farinholt', Shigenori Hirose’, 


Olga Zhuchenko’, Adam Kuspa”?* 


The social amoeba Dictyostelium discoideum maintains a microbiome during multicellular 
development; bacteria are carried in migrating slugs and as endosymbionts within amoebae 
and spores. Bacterial carriage and endosymbiosis are induced by the secreted lectin discoidin | 
that binds bacteria, protects them from extracellular killing, and alters their retention 
within amoebae. This altered handling of bacteria also occurs with bacteria coated by plant 
lectins and leads to DNA transfer from bacteria to amoebae. Thus, lectins alter the cellular 
response of D. discoideum to bacteria to establish the amoebae’s microbiome. Mammalian 
cells can also maintain intracellular bacteria when presented with bacteria coated with lectins, 
so heterologous lectins may induce endosymbiosis in animals. Our results suggest that 
endogenous or environmental lectins may influence microbiome homeostasis across 


eukaryotic phylogeny. 


ictyostelium discoideum are amoebae that 
live in the soil and feed on bacteria; as 
food becomes scarce, they aggregate into 
amound and form a migrating slug, which 
eventually develops into a fruiting body 
containing a spore-filled sorus on a cellular stalk 
(1). It had been thought that development oc- 
curred free of bacteria and that the final fruiting 
body was sterile (2). D. discoideum has an innate 
immune system composed of “sentinel” cells that 
are able to rid the slug of interstitial bacteria 
(3, 4); however, it was recently reported that about 
one-third of wild isolates maintain symbiotic in- 
teractions with the bacteria in their soil envi- 
ronment and carry bacteria stably through cycles 
of growth and development (5). In a form of mi- 
crobial farming, carried bacteria can seed a new 
food supply for germinating spores as the spores 
establish a new colony of amoebae (5). Addition- 
al interactions between D. discoideum and their 
bacterial associates have been documented, and 
there is evidence that they are controlled by 
specific signals (6-8). Our finding that amoebal 
lectins mediate bacterial carriage suggests that 
D. discoideum actively controls its microbiome and 
that amoeba-bacterium interactions may serve as 
a useful model for microbiome homeostasis. 
Amoebae that carry bacteria are easily distin- 
guished from noncarriers because migrating car- 
rier slugs leave behind bacteria that form colonies 
visible to the naked eye (fig. SIA). To assay for 
carriage, we allowed amoebae to initiate develop- 
ment in the presence of food bacteria such as 
Klebsiella pneumoniae (fig. S1B) and scored indi- 
vidual fruiting bodies for the presence of viable 
bacteria (fig. SIC). By this measure, >90% of a 
carrier strain’s sori contained bacteria (table S1) 
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and bacterial endosymbionts in their spores 
(fig. SID), whereas <10% of sori from noncarrier 
strains contained viable bacteria (table S1). It is 
unclear how bacterial carriage occurs, given the 
existence of an innate immune system, although 
significantly reduced sentinel cell populations 
in carrier strains have been reported (9). All of 
our carrier strains had sentinel cells (table S1), 
so we tested whether they cleared bacteria dur- 
ing extended slug migration. We found that car- 
riers maintained their bacteria through >6 cm 
of slug migration, whereas no viable bacteria 
were recovered from noncarriers after slug mi- 
gration, suggesting that sentinel cells do not in- 
fluence bacterial carriage (table S1). 

Because carrier strains maintain bacteria dur- 
ing development, we assessed their ability to 
kill bacteria as they transition from growth to 
development. The noncarrier laboratory strain 
AX4 killed extracellular bacteria within the plaque, 
whereas just outside the plaque, the bacteria 
were alive (Fig. 1A). Plaques of noncarrier strains 
were similar to those of AX4, whereas plaques 
of carriers had a mottled appearance, with live 
and dead bacteria segregated into distinct areas 
near the edges (Fig. 1B). Thus, extracellular bac- 
terial killing within carrier plaques is uneven, 
and this might explain the presence of live bac- 
teria as the amoebae begin to develop. 

We tested extracellular killing of K. pneumoniae 
by amoebae in suspension and found that non- 
carrier strains killed bacteria efficiently, but car- 
rier strains did not (Fig. 1C). We tested whether 
this difference in bacterial killing might be ex- 
plained by the proteins that they secrete during 
growth by examining the secretomes harvested 
from amoebae consuming K. pneumoniae. We 
fractionated the secretomes by ion-exchange 
chromatography and followed antibacterial ac- 
tivity by using an end-point dilution assay that 
measures the killing of K. pnewmoniae (fig. S2A). 
Antibacterial activity in noncarrier secretomes 
eluted from the column in a broad peak and was 


heat-labile (10 min, 65°C), but carriers produced 
very little, or undetectable, activity (Fig. 1D; fig. $2, 
B to D; and table S2). This prompted us to iden- 
tify regulators of bacterial carriage by screening 
for proteins that are differentially secreted by 
carrier versus noncarrier amoebae. The most 
prominent differences between carriers and non- 
carriers were two proteins in carriers with appar- 
ent molecular weights of ~24 and ~30 kDa (fig. 
S3A). We used mass spectrometry to determine 
that the 30-kDa band contained a mixture of the 
well-studied discoidin I lectins discoidin A (DscA) 
and discoidin C (DscC) (table S3) and that the 
24-kDa band contained a mixture of the calcium- 
dependent adhesion protein CadA and the re- 
lated proteins Cad2 and Cad3 (table S4) (10-13). 

Discoidin I forms dumbbell-shaped trimers 
with the subunits oriented in parallel, positioning 
the C-terminal H-type lectin domains at one end 
(14). Discoidin I binds to polysaccharides contain- 
ing N-acetylgalactosamine (GalNAc) and galac- 
tosamine in 61-63 linkages that are found in 
D. discoideum cell surface glycoproteins and bac- 
terial carbohydrates (15, 16). In laboratory strains, 
discoidin I is found inside amoebae, and it is 
secreted when they aggregate after the start of 
development (17, 78). We surveyed the secretion 
of discoidin I in wild strains, using antibodies 
against DscA and DscC, and found that carriers 
appeared to secret higher levels of these proteins 
at the start of development than noncarriers 
(Fig. 2A). One carrier that we examined in detail 
secreted >80 times more discoidin I than non- 
carriers (fig. S3B). We examined the timing of 
discoidin I secretion from the end of vegetative 
growth through developmental aggregation and 
found that carriers secreted higher amounts of 
discoidin I at earlier times compared with non- 
carriers (e.g., Fig. 2B). These results suggest that 
discoidin I secretion before development is a 
physiological feature of carriers that may de- 
termine bacterial carriage. To explore this, we 
first measured bacterial binding and found that 
DscA binds quantitatively to K. pneumoniae bac- 
teria, likely through the GalNAc moieties present 
in the lipopolysaccharide of many K. pneumoniae 
strains (Fig. 3A) (14, 19). DscA displayed saturation 
binding to K. pneumoniae and Escherichia coli 
(Fig. 3B), and we estimated that DscA binds to 
K. pneumoniae with micromolar affinity (associ- 
ation constant K, = 1.2 x 10° M~}) through ~1.4 x 
10° binding sites on each bacterium (supplemen- 
tary materials) (20). About 2.75 x 10° DscA pro- 
teins would be required to form a monolayer 
covering the surface of one K. pnewmoniae bacte- 
rium, assuming that the lectin ends of the roughly 
cylindrical trimers were opposing an idealized 
smooth surface with the shape and volume of 
the bacterium (supplementary materials). Thus, 
additional binding sites within the glycocalyx of 
K. pneumoniae likely account for the fivefold 
higher DscA binding that we observed. 

We tested whether discoidin I binding alters 
K. pneumoniae’s interaction with amoebae by 
examining bacterial killing, carriage, and endo- 
symbiosis. First, we tested whether discoidin I 
protects bacteria from killing by D. discoideum 
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antibacterial proteins (Dabs). DscA-bound 
K. pneumoniae were protected from killing by 
Dabs in an overnight outgrowth assay (fig. S2F) 
and in a short-term viability assay (Fig. 3C). Nota- 
bly, the protection in the outgrowth assay dimin- 
ished when the amount of discoidin added was 
below that needed to achieve saturation binding 
to the bacteria (fig. S2F). However, DscA-coated 
K. pneumoniae were not resistant to killing 
by heat (65°C, 10 min) or antibiotic treatment 
(kanamycin at 50 ug/ml). If protection by dis- 
coidin I is a mechanism by which carrier strains 
spare bacteria from killing, carriers would have 
to secrete sufficient discoidin to protect some 
bacteria as the amoebae enter development. 
Thus, we reexamined carrier secretomes for 
cryptic Dabs that might have been masked by 
discoidin I. Removing discoidin I from carrier se- 
cretomes by passing them through polygalactose 
affinity columns revealed latent antibacterial 
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activity (Fig. 1D, fig. S2E, and table $2). Our re- 
sults indicate that discoidin I can counteract 
the Dabs secreted by carriers, at least in vitro, and 
this suggests that discoidin I is secreted by carrier 
strains at levels sufficient to spare some bacteria 
for carriage as amoebae enter development. 

We tested whether discoidin I influences the 
bacterial carriage and endosymbiosis that define 
the carrier phenotype. We mixed D. discoideum 
amoebae with K. pnewmoniae or with DscA- 
coated K. pneumoniae at the start of development 
and tested the resulting fruiting-body sori for live 
bacteria. A single treatment of K. pnewmoniae 
with DscA resulted in bacterial carriage by the 
noncarrier strain QS4 and the noncarrier labo- 
ratory strain AX4 (table S1). We also observed 
live green fluorescent protein (GFP)-expressing 
K. pneumoniae within the D. discoideum spores, 
but only when the bacteria were pretreated with 
DscA (Fig. 4A). These results suggest that ex- 


QS69 (car) 


QS70 (car) 


tracellular discoidin I is sufficient to account for 
bacterial carriage in D. discoideum by binding 
bacteria, protecting them from secreted anti- 
bacterial proteins, and inducing carriage and 
endosymbiosis. 

Some of the internalized discoidin-coated bac- 
teria escape intracellular killing and digestion 
by the phagolysosomal pathway of the amoebae 
and end up in the spores. Thus, discoidin I al- 
ters the handling of bacteria by amoebae so that 
live bacteria persist, a phenomenon that we 
have termed lectin-induced modified bacterial 
internalization, or LIMBI. We demonstrated the 
specificity of LIMBI for lectin-coated bacteria 
with mixing experiments using discoidin-coated 
bacteria and uncoated bacteria. We presented 
DscA-bound red fluorescent protein (RFP)- 
expressing E. coli and GFP-expressing E. coli 
together to D. discoideum and followed their 
uptake by the amoebae over several hours by 


QS74 (car*) QS80x(car*) 


500 —gg— QS23 (non-carrier) eeeneos 
—®— QS37 (carrier) 
--O--- QS37 (Dsc removed) 


0 30 60 90 
time (min) 


Fig. 1. Extracellular killing of bacteria by D. discoideum and secreted 
antibacterial proteins. (A) Plaques of D. discoideum strain AX4 growing 
on lawns of K. pneumoniae were stained with fluorescent dyes that 
distinguish live and dead bacteria, as visualized by fluorescence micros- 
copy (upper left panel) and differential interference contrast (DIC) 
microscopy (lower left panel). Numbers (1 and 2) indicate the areas that 
were sampled to reveal dead (red) and live (green) bacteria (right panels). 
(B) Plaques of wild carrier (car*) and noncarrier (car_) strains, stained 
as in (A). (C) Carrier (circles) and noncarrier (Squares and diamonds) 
amoebae were mixed with K. pneumoniae in buffer, and the number of 
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fraction number 


viable bacteria were determined over time and normalized to 100% at the 
start of the assay. Values are means + SEM for three independent 
experiments. (D) Secretomes from QS23 (noncarrier) and QS37 (carrier) 
were resolved by ion-exchange chromatography (DEAE-Sepharose) 

with a step-elution of increasing salt (NaCl, dashed line), and the fractions 
(1 ml) were assayed for antibacterial activity against K. pneumoniae. Trace 
activity was sometimes observed in carrier secretomes and elutes near 
the peak observed in noncarriers (asterisk). When discoidin | is removed 
from carrier secretomes, antibacterial activity becomes apparent 

(open circles). 
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Fig. 2. Discoidin | secretion by wild carrier 
strains. Wild strains were harvested during 
late-stage growth on lawns of K. pneumoniae 
bacteria (when most of the bacteria had been 
consumed), shaken in Sor buffer for 1 hour 
before being separated into supernatant (S) 
and pellet (P) fractions, and resolved on 
SDS-polyacrylamide gel electrophoresis 
(SDS-PAGE) protein gels. (A) Western blots 
stained with antibodies against discoidin I, 
showing soluble and cell-associated discoidin | 
in carrier and noncarrier strains. The DscA/C 
control (left lane) was purified from AX4 
(supplementary materials). (B) A representative 
time course of discoidin | production, from 
before bacteria are consumed to the start of 
the growth-to-development transition, for a 
noncarrier (QS14) and a carrier (QS68) strain. 
The 3-hour point is roughly equivalent to the 
time of harvest in (A). 


Fig. 3. Discoidin | binds and protects bacteria. 
(A) Increasing numbers of K. pneumoniae (K.p.) 
bacteria (1X = 2.3 x 10’ bacteria) were incubated 
with purified DscA (300 ug/ml) in 100 ul buffer for 
60 min at room temperature. Bacteria were 
separated from soluble DscA by centrifugation, 
and the supernatant (S) and pellet (P) fractions 
were resolved by SDS-PAGE and visualized by 
Coomassie blue staining. (B) DscA (12.5 to 

62.5 ug) was incubated with 1.3 x 10° K. pneumoniae 
or E. coli in 100 ul for 60 min. Bacteria were 
removed by centrifugation, and the amount of 


A & QS4 =QS14. QS18 = QS31 QS35 QS38 QS68 QS70 
s car car car* car* car car car* —car* 
a Ss P S P S PS _ P S PS P S P S P 

kDa - 

43= 


unbound DscA was determined and used to calculate the amount of DscA bound to the 
bacteria (shown as means + SEM). The asterisk indicates that no unbound DscA was 
detected. The dashed line shows the saturation point estimate. (C) K. pneumoniae 
bacteria, pretreated with DscA or bovine serum albumin (BSA), were suspended in buffer 
and challenged with Dabs. Bacterial viability was quantitated by counting live bacteria 


at various times, using a live-dead staining reagent 
normalized to the zero-time samples. Statistically s 


***P < 0.001) were observed between DscA-treated and BSA-treated bacteria at all time 


points (one-way pairwise analysis of variance using 
comparisons of means). Values are means + SE 


means of time-lapsed fluorescence imaging. 
Amoebae took up and digested uncoated bac- 
teria, whereas some discoidin-coated bacteria 
remained intact and were scattered throughout 
the cytoplasm. The reciprocal experiment with 
discoidin-coated GFP-expressing E. coli resulted in 
retention of only those bacteria. The differential 
handling within the same amoeba of discoidin- 
coated and uncoated bacteria indicates that 
discoidin must be bound to the bacterium for 
it to persist after internalization. We quantified 
the persistence of bacteria after LIMBI by mea- 
suring the length of time that GFP-labeled bac- 
teria remained intact within vegetative amoebae, 
using time-lapse imaging in an under-agar assay 
to create a pseudo-two-dimensional view (Fig. 4B) 
(21). By following bacteria from the time that they 
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were internalized by amoebae to the time that 
they lost their structural integrity, we found that 
discoidin-coated bacteria persisted nearly seven 
times longer than uncoated bacteria (Fig. 4C). 
LIMBI may provide an opportunity for bac- 
terial DNA transfer into the host nucleus, and 
we examined this by inducing LIMBI of DscA- 
coated E. coli harboring a shuttle plasmid and 
selecting for drug-resistant amoebae. Each of the 
discoidin-coated E. coli strains introduced into 
D. discoideum by LIMBI led to genetic trans- 
formation of the amoebae at high frequencies 
(~2 to 20%), including those carrying plasmids 
with G418 (neo’) or blasticidin (bs”) resistance 
genes (table S5). To visualize LIMBI transforma- 
tion, we used plasmids that express D. discoideum 
histone H2b fused to the RFP mCherry and 


0 10 20 30 40 50 60 70 


Firman [minittac\ 


observed that all of the transformed amoebae 
expressed the mCherry-tagged histone within 
their nuclei (Fig. 4D and table S5). Immunoblots 
of proteins from these cells confirmed that the 
fusion protein was expressed (fig. S4A). Somewhat 
unexpectedly, ~10% of the amoebae expressed 
RFP-H2b 24 hours after LIMBI without selection 
for drug resistance, suggesting that transient 
expression also occurs (table S5). These results 
indicate that DNA can be transferred from the 
bacteria into the nucleus of the amoebae, imply- 
ing that LIMBI is functionally distinct from the 
phagolysosomal digestion pathway that amoebae 
use to process bacteria as food. 

We tested the generality of LIMBI in eukary- 
otes by testing plant lectins as elicitors and mam- 


malian cells as recipients. We identified several 
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A feSicinka 
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DAPI 
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Fig. 4. Lectin-induced modified bacterial internalization (LIMBI) results in bacterial endo- 
symbiosis, increased persistence of bacteria within amoebae, and genetic transformation. 
(A) D. discoideum amoebae (QS4) were mixed with GFP-expressing K. pneumoniae that had been 
pretreated with buffer (mock) or DscA and allowed to develop into fruiting bodies. Fluorescence 
microscopy of the resulting spores shows intact bacteria in spores after LIMBI with DscA. (B) An 
overlay image of DIC and fluorescence microscopy, showing vegetative amoebae and GFP- 
expressing and DscA-coated E. coli under agar. The persistence of the bacteria within amoebae 
was quantified by determining the number of minutes that each bacterium was observed to remain 


intact. (C) Box plots of the persistence time of di 
LIMBI. The edges of the boxes represent the 75th 
medians, the whiskers are the minimums and ma 
between LIMBI and the no-lectin control was stat 


scoidin-coated bacteria within amoebae after 
and 25th percentiles, thick lines represent 

ximums, and dots are outliers. The difference 
istically significant (P < 0.001, Mann-Whitney- 


Wilcoxon rank-sum test). (D) LIMBl-mediated gene transfer from E. coli into D. discoideum. 
Discoidin-coated E. coli harboring a plasmid engineered to express a red fluorescent protein 
fusion to D. discoideum histone H2b (mCherry-H2b) were mixed with AX4 amoebae, followed 
by 10 days of drug selection for the plasmid. Shown are a DIC microscopy image, fluorescence 


microscopy images of nuclear DNA stained with DAPI (4’,6-diamidino-2-phenylindole) or mCherry, 
and an overlay showing nuclear expression of mCherry-H2b. 


plant-derived lectins that bound to E£. coli, in- 
cluding Wisteria floribunda agglutinin, Dolichos 
biflorus agglutinin, soybean agglutinin (SBA), and 
concanavalin A (ConA) (22). We mixed lectin- 
coated bacteria with D. discoideum, using the 
same protocols developed with discoidin I, and 
found that each of the plant lectins appeared 
to be equally effective for LIMBI and LIMBI- 
mediated transformation (fig. S4, B and C, and 
table S6). We also introduced lectin-coated E. coli 
to mammalian cells in culture and found that 
DscA, ConA, and SBA were each equally effective 
for LIMBI and LIMBI-mediated transformation 
of mouse RAW264 macrophages (tables S5 and 
S6). Competition experiments with admixtures 
of GFP- and RFP-expressing bacteria showed 
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that only lectin-coated bacteria persisted intact 
within RAW264 macrophages after overnight 
incubation (fig. $5). Our results show that LIMBI 
can occur in amoebae and in animal cells, using 
amoebal or plant lectins, suggesting that modi- 
fied uptake and persistence of lectin-coated bac- 
teria in eukaryotic cells is a general phenomena. 

We have described physiological differences 
between D. discoideum carrier and noncarrier 
strains that appear to account for the salient 
features of bacterial carriage during social amoe- 
bae development. Because discoidin I is suf- 
ficient to induce carriage by noncarrier strains, 
its secretion by carrier strains provides a plausible 
mechanism for the transition of carrier amoebae 
from active bacterial killing and feeding to star- 


vation and bacterial symbiosis. Prestarvation factor 
(PSF) is a ~’70-kDa secreted protein that reports 
on the bacterial food supply at the growth-to- 
development transition (23, 24). PSF is seques- 
tered by bacteria when bacterium/amoeba ratios 
are high, but as the bacteria are depleted by 
amoebal feeding, the level of free PSF rises and 
stimulates D. discoidewm development through 
autocrine signaling. PSF has never been identi- 
fied, and because discoidin I binds bacteria and 
affects the kill-versus-carriage dynamics of amoe- 
bae, we are testing whether discoidin I is PSF. 
Discoidin I’s H-type lectin domains are on the 
C termini of the trimer subunits, but the N-terminal 
discoidin domains have distinctive structural sim- 
ilarities to F-type lectins and have been proposed 
to bind fucose and sialic acid containing N- or 
O-glycans (14). If this is true, it is possible that 
the H-type lectin domains of discoidin I bind 
bacteria, leaving the discoidin domains free 
to interact with N- or O-linked glycans on the 
cell surface of D. discoideum amoebae to ini- 
tiate LIMBI or engage in receptor-mediated sig- 
naling. If the uptake mechanism does involve 
amoebal glycoproteins as receptors, plant lectins 
might act as bacteria-bound ligands for those 
same receptors. 

Lectinophagocytosis results in efficient bac- 
terial uptake by eukaryotic cells through spe- 
cific receptor recognition of lectins provided by 
the bacteria or the host cell (25, 26). Plant lectins 
can also mediate macrophage lectinophagocy- 
tosis (27, 28), and in some cases this is concomi- 
tant with the suppression of bacterial killing 
(29). It is well known that lectins target bacteria 
for destruction by the innate immune system of 
the host, but there are reports of lectins with 
functions unrelated to defense (30, 31). If lectin 
protection of bacteria within hosts proves to be 
a general feature of multicellular eukaryotes, it 
would provide a new perspective on the regula- 
tion of microbiomes in well-studied systems, in- 
cluding humans. For example, LIMBI may promote 
the maintenance of intracellular bacteria in the 
mucosal epithelium or myofibroblasts of the colon 
and contribute to inflammatory disease processes. 
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GUT INFLAMMATION 


LKBI deficiency in T cells 
promotes the development 
of gastrointestinal polyposis 


M. C. Poffenberger”’”, A. Metcalfe-Roach’, E. Aguilar’?, J. Chen’”, B. E. Hsu™*, 

A. H. Wong?”, R. M. Johnson’, B. Flynn”, B. Samborska’, E. H. Ma’, S.-P. Gravel”, 
L. Tonelli’, L. Devorkin®, P. Kim®, A. Hall””, S. Izreig’”, E. Loginicheva®, 

N. Beauchemin”®, P. M. Siegel”*, M. N. Artyomov®”°, J. J. Lum®™, G. Zogopoulos””, 


J. Blagih””, R. G. Jones?”??* 


Germline mutations in STK11, which encodes the tumor suppressor liver kinase B1 
(LKB1), promote Peutz—Jeghers syndrome (PJS), a cancer predisposition syndrome 
characterized by the development of gastrointestinal (GI) polyps. Here, we report 
that heterozygous deletion of Stk11 in T cells (LT"*t mice) is sufficient to promote Gl 
polyposis. Polyps from LT"* mice, Stk11*’~ mice, and human PJS patients display 
hallmarks of chronic inflammation, marked by inflammatory immune-cell infiltration, 
signal transducer and activator of transcription 3 (STAT3) activation, and increased 
expression of inflammatory factors associated with cancer progression [interleukin 6 
(IL-6), IL-11, and CXCL2]. Targeting either T cells, IL-6, or STAT3 signaling reduced 
polyp growth in Stk11*”~ animals. Our results identify LKB1-mediated inflammation 
as a tissue-extrinsic regulator of intestinal polyposis in PJS, suggesting possible 
therapeutic approaches by targeting deregulated inflammation in this disease. 


iver kinase B1 (LKB1) is a multifaceted 
serine/threonine kinase with roles in me- 
tabolism, cell polarity, cell size control, and 
proliferation (J). Somatic mutations in 
STKII are associated with several human 
cancers, including lung cancer (where it is fre- 
quently comutated with KRAS) and gynecological 
cancers (ovary, uterus, and cervix) (2). Heterozy- 
gous germline mutations in STK1/ predispose 
both humans and mice to the development of 
Peutz-Jeghers syndrome (PJS) (3-7), an auto- 
somal dominant hereditary disease characterized 
by the development of benign gastrointestinal 
(GI) hamartomatous polyps (8). PJS patients also 
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have a high cumulative cancer risk, with >90% 
chance of developing tumors by age 65 (9). 

Much of our understanding of PJS devel- 
opment and pathogenesis has been derived 
from Stk1I*/- mice, which develop GI polyps 
histologically similar to those found in PJS pa- 
tients, with an average latency of 9 to 12 months 
(5-7). Deletion of Stk17 in smooth-muscle cells 
promotes GI polyp formation with lower pene- 
trance than that in whole-body Stk1I*”" mice 
(~60% versus 85 to 100%) (10). We used a Villin- 
Cre transgene to delete Stk in intestinal epithe- 
lial cells (IECs) of the small intestine and colon. 
We found no evidence of GI polyps in mice het- 
erozygous (Stk1I!*;Villin-Cre) for Stk11 in IECs, 
even those >52 weeks of age (table S1), which 
argues for additional nonepithelial functions 
for LKB1 that contribute to disease development in 
Stkil’”- mice and PJS patients. 

LKBI1 plays an essential role in regulating 
hematopoietic stem-cell metabolism and survi- 
val (11-13). Unexpectedly, we found that aged 
mice (>52 weeks) with heterozygous deletion 
of Stk1I throughout the hematopoietic system 
(LH: stkir/*;Vav-iCre) developed GI polyps 
at the junction of the small intestine and stomach 
reminiscent of PJS polyps (Fig. 1A). Polyps ob- 
served in LH®* mice displayed an arborizing 
smooth-muscle core and were histologically indis- 
tinguishable from polyps observed in Stk1I*/- 
mice (Fig. 1A). We next generated mice harbor- 
ing heterozygous or homozygous Stk11 mutations 
in either the T cell (LT"*: Stk11"/*:Lck-Cre and 
LT": stkiz"/";Lck-Cre) or B cell (LB‘°: Stkir"/"; 
Cd19-Cre) compartments. LT"* mice developed 
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GI polyps at an incidence similar to that of LH®* 
mice, and homozygous deletion of Stk1/ in T cells 
(LT*°) doubled the incidence of polyp formation 
(Fig. 1, B and C, and table $2). LB*° mice did not 
develop disease (table S1). A subset of male Lr 
and LT*° mice also displayed liver lesions and 
increased rates of hepatocellular carcinoma (HCC) 
relative to those of control animals (table $3), a 
phenotype often observed in Stk1I*/- mice (14). 
Genotyping and immunoblot analysis confirmed 
reduced LKB1 expression in T cells but not liver 
tissue of LT®* mice (fig. S1). Thus, T cell-specific 
loss of Stk11 is sufficient to promote intestinal 
polyposis. 

Many of LKB1’s functions have been attributed 
to its stimulation of adenosine 5'-monophosphate 
(AMP)-dependent protein kinase (AMPK)- 
dependent stress responses (2). However, mice 
with T cell-specific (AT: Prkaat™” 1. Cq4-Cre) or 
hematopoietic cell-specific (AH*°: Prkaar", 
Vav-iCre) loss of AMPKa1 did not develop GI 
polyps (table S1). GI polyps from LT®* and LT®° 
mice displayed increased mammalian target of 
rapamycin complex 1 (mTORC1) signaling (fig. 
$2A), which is characteristic of polyps from 
Stkir*/~ mice (15). However, mice with condi- 
tional deletion of both LKB1 and mTOR in T cells 
(LT®°:T-Frap7": StkiP".Frap":Lck-Cre*) still 
developed GI polyps (fig. S2B). Thus, AMPK or 
mTOR pathway activity in T cells does not ap- 
pear to contribute to PJS polyp development in 
these mouse models. 

Histological analysis of PJS polyps revealed 
evidence of prominent immune cell infiltration 
in polyps from whole-body Stk1z*/~, LT", and 
LT®° mice (Fig. 2A and fig. S3A). T cells (CD4* 
and CD8*), macrophages, and neutrophils were 
diffusely present throughout the polyps, with 
lymphocytes also organizing into concentrated 
centers in the smooth-muscle core of the polyps 
(Fig. 2A and fig. S3, A to C). Histological analysis 
of polyps from two independent, unrelated PJS 
patients revealed clustering of T cells in the 
smooth-muscle core and stroma of the polyps 
(Fig. 2B), similar to our observations in mouse 
samples. To assess the impact of lymphocytes 
on disease progression, we crossed Stk1I*/- mice 
onto a Rag2-deficient background (which lack 
mature lymphocytes because of a block in devel- 
opment). Although Stk1I*/";Rag2~”~ mice still 
developed polyps, polyp size was significantly 
decreased in Stk1I*/- mice lacking lymphocytes 
(Fig. 2, C and D). 

We next assessed inflammatory cytokine ex- 
pression in polyps and adjacent nonpolyp tissue 
from aged control and LT®™ animals (>52 weeks 
of age) (Fig. 2E). Polyps from diseased LT" mice 
consistently expressed increased levels of inflam- 
matory cytokines—including /110, 16, [111, and 
Tnfa—compared with GI sections from control 
animals or LT?“ mice that did not develop dis- 
ease (Fig. 2E). Notable in this profile were high 
levels of 16 and Jl11, cytokines that are asso- 
ciated with chronic gastric inflammation and GI 
tumor development (J6-18). Similar changes in 
inflammatory cytokine gene expression were ob- 
served in polyps from Stk1I*/~ mice (Fig. 2F). The 
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expression of Cxcl2 (MIP-2), a proinflammatory 
chemokine that directs immune-cell recruitment, 
was increased in both polyps and adjacent non- 
polyp tissue from LT" and StkiI*’" mice (Fig. 2, 
G and H). Profiling immune-cell subsets from 
GI tissue of control or polyp-bearing LT®* mice 
by means of flow cytometry revealed the in- 
creased presence of inflammatory monocytes 
(CD11b*Gri™*) and neutrophils (CD11b*Gr1"™) 
in polyps of mice heterozygous for LKB1 in T cells 
(Fig. 21). 

To test whether inflammation could influence 
PJS polyp formation, we subjected young con- 
trol and StkiI*/- mice to sublethal irradiation 
[4.5 grays (Gy)], which stimulates acute muco- 
sitis and inflammatory cytokine production in 
the GI tract (19). All Stk1I*/~ mice that received 
radiation developed polyps within 16 weeks of 
treatment, whereas age-matched nonirradiated 
StkiI*/- mice did not develop visible polyps 
during this time (Fig. 2, J and K). Thus, immune 
infiltration and inflammatory cytokines are hall- 
marks of PJS polyps, and damage-associated 
inflammation can initiate polyp formation in 
genetically susceptible Stk1I*/~ mice. 

We next investigated how LKB1 loss in T cells 
could account for the proinflammatory nature 
of PJS polyps. RNA sequencing (RNA-seq) analy- 
sis of in vitro activated control and LT*° CDs* 
T cells revealed enrichment in several pathways 
central to T cell activation (cytokines/chemokines, 
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ribosome biogenesis, and phosphatidylinositol 
3-kinase signaling) in LT®° T cells (fig. S4). Among 
the pathways specifically enriched in activated 
LT® CD8* T cells were Janus kinase (JAK)- 
signal transducers and activators of transcription 
(STAT) signaling and cytokine-cytokine receptor 
interactions (figs. S5 and S6), the former scoring 
highly owing to increased cytokine mRNA ex- 
pression in activated LT T cells. The expression 
of several of these cytokines (/fng, 119, and 1124) 
in activated LT" and LT’? CD8* and CD4** T cells 
was confirmed with quantitative polymerase 
chain reaction (fig. $7, A and B). Stk11-deficient 
CD4* and CD8* T cells activated under non- 
polarizing conditions displayed increased secre- 
tion of many of the inflammatory factors observed 
in PJS polyps, including CXCL2, CXCL9, and 
interleukin 6 (IL-6) (Fig. 3, A and B). The acti- 
vation of naive LT"* or LT*° CD4* T cells led 
to the increased generation of interferon-y 
(IFN-y)- and IL-17A-producing CD4* T cells under 
T helper 1 (Ty1)- and Ty17-polarizing conditions, 
respectively (Fig. 3, C to E). LKB1-deficient T},17 
cells also displayed increased IL-6 and tumor necro- 
sis factor-o, (TNF-o.) production (Fig. 3, E and F). 
LKB!1 deletion in T cells leads to deregulated 
metabolism and increased IFN-y production by 
CD4* and CD8* T cells (20). We observed pro- 
gressive deregulation of T cell homeostasis based 
on Stk1I gene status, marked by an increased 
frequency of activated CD4* and CD8* T cells 
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Fig. 1. T cell-specific deletion of LKB1 promotes intestinal polyposis. (A) Representative 
hematoxylin and eosin (H&E)—stained sections of stomachs (S), small intestines (D), and 

polyps (P) from Stk11*’* (n = 84), Stk11*”” (n = 64), LH™* (n = 28), and LH* (n = 38) mice 

aged >52 weeks. Images in the right column are a higher magnification of the boxed region 
indicated in the left column. Scale bars, 1 mm. (B) Representative H&E-stained sections 

of stomach (S), small intestine (D), polyps (P), liver, and liver nodules (N) from LT™* (n = 102), 
LThet (n = 64), and LT*° (n = 50) mice aged >52 weeks. Images in the center column are a higher 
magnification of the boxed region in the left column. Scale bars, 1 mm. (C) Polyp size in Stk11*’~ 
(n = 16), LT'* (n = 10), and LT*° (n = 9) mouse strains aged >52 weeks. Data are represented 


as the mean + SEM. *P < 0.05 compared with control samples (Stk1 


1*”~ mice), one-way analysis 


of variance (ANOVA), Dunnett’s multiple comparisons test. 
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(CD44") in LT®* and LT®° mice (fig. $8, A and 
B). Increased proportions of activated IFN-y- 
and IL-17A-producing T cells, as well as IL-6- 
producing CD4* T cells, were detected in the 
GI-draining mesenteric lymph nodes (mLN) of 
LT mice (Fig. 3G and fig. S8C). Increased fre- 
quency of polyfunctional (IFN-y*TNF-o"*) inflam- 
matory T cells was observed in polyps isolated 
from LT®* and LT* animals (Fig. 3H). The ex- 
pansion of inflammatory CD4* Ty cells in LT*° 
mice was likely not due to defects in regulatory 
T cell (T,¢g cell) populations as control, LT 
and LT*° mice displayed similar levels of CD4 
*Foxp3* Treg cells (fig. S9, A and B), and we ob- 
served no difference in suppressive capacity be- 
tween control and LKB1-deficient CD4* Tyeg cells 
(fig. S9, C and D). 

We next tested whether Stk1/-mutant T cells 
could modulate inflammation in surrounding 
tissues. Supernatants from Stk17 mutant T cells 
stimulated a fourfold increase in //6 expression 
in nontransformed mouse embryonic fibroblasts 
(MEFs) compared with supernatants from wild- 
type T cells (Fig. 31). Stki77/~ MEFs secreted IL-6 
in the absence of inflammatory stimuli and pro- 
duced significantly more CXCL2 and IL-6 when 
stimulated with lipopolysaccharide (LPS) (Fig. 3J 
and fig. SIOA). Deregulated IL-6 production by 
Stk1/~ MEFs was not limited to LPS treatment 
because other Toll-like receptor (TLR) ligands 
strongly induced IL-6 production in LKB1- 
deficient MEFs (Fig. 3J). IL-6 production by 
LKB1-deficient MEFs was insensitive to mTORC1 
inhibition by use of rapamycin (fig. S10B). Thus, 
in addition to aberrant immune-cell function, 
Stk11 mutations in nonimmune tissues (epithe- 
lial and/or stromal cells) in PJS patients may 
confer increased responsiveness to inflammatory 
stimuli. 

In inflammation-driven cancers such as gas- 
tric and colon cancer, tumor progression is driv- 
en by chronic STATS activation downstream of 
inflammatory cytokines such as IL-6 and IL-11 
(21). Consistent with the abundance of STAT3- 
activating cytokines in the GI microenvironment, 
polyps from LT®*, LT*°, and Stk1I*/" mice dis- 
played elevated phospho-STAT3 (Y705) stain- 
ing compared with similar sections from LT 
and Stki1*/* mice (Fig. 4A). A similar distribution 
of phospho-STAT3 (Y705) staining was observed 
in the stromal compartment of polyps from PJS 
patients (Fig. 4B). Both polyps and adjacent non- 
polyp tissue from diseased mice exhibited in- 
creased STAT3 phosphorylation compared with 
that of nondiseased tissue from control mice 
(Fig. 4, C and D). Expression of several STAT3 
target genes, including CendI and Socs3, was 
increased in polyps (Fig. 4E and fig. S11A). Ad- 
ditionally, the STAT3 target gene 77r2, which 
encodes a pattern-recognition receptor poten- 
tiating inflammatory responses to bacterial pep- 
tidoglycans, was significantly increased in the 
polyps of LT®* and Stk1I*/~ mice (Fig. 4F). In- 
creased TLR2 expression and polymorphisms 
in the TLR2 gene are associated with gastric 
cancer (22, 23). Conditioned medium from ac- 
tivated LKB1-mutant (LT and LT*’) T cells was 
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Fig. 2. Immune-cell infiltration and expression of proinflammatory 
genes are hallmarks of PJS polyps. (A and B) Representative H&E-, CD3- 
(T cell) and Iba-1- (macrophage) stained sections of stomach, small intestine, 
and polyps from (A) mice aged >52 weeks (n = 3 for each genotype) and 

(B) PJS patients (n = 2). Scale bars, 100 um. (C) Representative H&E staining 
of GI sections from Stk1I*”~ (n = 18) or Stk1l*”;Rag2“- (n = 15) mice aged 
>52 weeks. Scale bars, 5 mm. (D) Polyp size for animals in (C). Data are 
represented as the mean + SEM. *P < 0.05 compared with Stk11*”~ mice, 
unpaired Student's t test. (E) Normalized relative mRNA levels of inflammatory 
cytokines ((I1b, II6, II11, and Tnfa) in GI tissue from LT (WT) (n = 5) or 
nondiseased (ND) LT"*t mice (n = 4), or polyps (P) (n = 4) and adjacent 
nonpolyp tissue (NP) (n = 4) from diseased LT"*t mice aged >52 weeks. Gene 
expression relative to Tbp mRNA levels. Data are represented as the mean + SEM. 
*P < 0.05 compared with LT™ mice, one-way ANOVA, Dunnett's multiple 
comparisons test. (F) Normalized relative cytokine mRNA levels in GI tissue 
from Stk11*”* (WT) (n = 16) or polyps (P) (n = 11) and adjacent nonpolyp tissue 
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(NP) (n = 12) from Stk1I*”~ mice aged >52 weeks. Gene expression relative 
to Tbp mRNA levels. Data are represented as the mean + SEM. *P < 0.05 
compared with Stk11*”* mice, one-way ANOVA, Dunnett's multiple compar- 
isons test. (G and H) Cxcl2 mRNA levels in LT"* (n = 4 or 5) and Stk11*”~ 
(n= 6 to 13) mice aged >52 weeks as in (E) and (F). Gene expression relative to 
Tbp mRNA levels. Data are represented as the mean + SEM. *P < 0.05 
compared with LT™* or Stk11*”* mice, one-way ANOVA, Dunnett's multiple 
comparisons test. (I) Representative flow cytometric plots for frequency of 
CD11b* and Gr1* neutrophils and inflammatory monocytes isolated from Gl 
tissue from control (LT) (n = 13) or diseased LT" (n = 13) mice (CD45* 
population). (J) Representative H&E staining of stomach (S), small intestine 
(D), and polyp (P) sections from Stk11*”* (n = 5) and Stk11*“" (n = 7) mice 

16 weeks after irradiation (4.5 Gy). Scale bars, 1 mm. (K) Size of polyps in 
Stk1l*”~ mice 16 weeks after irradiation as in (J). Data are represented as 

the mean + SEM and are representative of two separate experiments. 

*P < (0.05 compared with nonirradiated Stk11*”- mice, unpaired Student's t test. 
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Fig. 3. Deregulated cytokine production and stimulation of inflamma- 
tion by LKB1-deficient T cells. (A and B) Heatmap of cytokine production 
by activated unpolarized LT*t (n = 6), LTN* (n = 7), and LT’° (n = 6) 

(A) CD4* and (B) CD8* T cells as measured with MILLIPLEX assay. Shown 


are cytokines with significant (P < 0.05) up-regulation in LT 
(C) Representative flow cytometry plots for IL-17 and IFN-y 


T° T cells. 
production by 


LT LTP! and LT" CD4* T cells cultured under Ty1- and T, 


H17-polarizing 


conditions. Values displayed represent average percent + SEM for biological 
replicates (n = 2 or 3). Data are representative of three separate experiments. 
(D and E) Enzyme-linked immunosorbent assay for IFN-y production by 

Tyl cells (D) or IL-17 and IL-6 production by T,17 cells (E) of the indicated 
genotypes generated as in (C). Data represent the mean + SEM for technical 
triplicates. *P < 0.05 compared with LT” control samples, one-way ANOVA, 
Dunnett's multiple comparisons test. (F) Representative flow cytometric 
plots showing frequency of IL-17 and TNF-a production by LT’, LTh*t, and LT" 
CD4* T cells cultured under T,,17-polarizing conditions as in (C). Values 
displayed represent average percent + SEM for biological replicates (n = 2 
or 3). (@) Number of IFN-y-, IL-17-, and IL-6-producing CD4* and CD8* 
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T cells in the mesenteric lymph nodes (mLN) of LTwt (n = 19 to 29), LTM* 
(n = 11 to 15), and LT*° (n = 5 to 12) mice (>52 weeks of age). Cytokine 
production by CD4* and CD8* Tcells was determined by means of intracellular 
staining after restimulation of T cells directly ex vivo. *P < 0.05 compared 
with LT control samples, one-way ANOVA, Dunnett's multiple comparisons 
test. (H) Representative flow cytometric plots for frequency of TNF-a— and 
IFN-y—producing CD4* and CD8* T cells isolated from healthy Gl tissue from 
control mice (LT) (n = 13) or polyp-bearing LT" (n = 13) or LT®° (n = 2) mice. 
(I) IG mRNA levels in MEFs after culture (24 hours) with supernatants from 
activated LT™, LT'*t or LT° CD8* Tcells (n = 3 mice for each genotype). 
Untreated MEFs cultured with regular medium are shown. Gene expression 
relative to Tbp mRNA levels. *P < 0.05 compared with control samples (LT), 
one-way ANOVA, Dunnett's multiple comparisons test. (J) IL-6 production 

by Stk11*”* or Stk1l-”~ MEFs after a 6-hour stimulation with LPS from 
Escherichia coli (LPS-EK), heat-killed Helicobacter pylori (HKHP), Pam3CSK4, 
FSL-1, or sham treatment. Data are represented as the mean + SEM for 
biological duplicates. Data are representative of two separate experiments. 
*P < 0.05 compared with Stk11*”* MEF samples, unpaired Student's t test. 
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Fig. 4. LKB1-deficient T cells promote STAT3 activation and expres- 
sion of STAT3-dependent growth-promoting and inflammatory genes 
in PJS polyps. (A and B) Representative immunohistochemistry (IHC) 
for P-STAT3 (Y705) staining in healthy tissue from control animals (LT™* 
and Stk11*”*) (n = 10 and 8, respectively) and polyps from (A) LT" 

(n = 11), LT*° (n = 12), and Stk11*”” (n = 8) mice aged >52 weeks or (B) PJS 
patients (n = 2). Scale bars, 30 um. (C and D) Immunoblot for STAT3 
activation (pY705 and pS727) in control (LT and Stk11*“*), nondiseased 
LT"*t (ND), polyp (P), and adjacent healthy tissue (NP) from (C) LTh* 
and (D) Stk11*”” mice aged >52 weeks. (E and F) Relative mRNA 
expression for STAT3 target genes Ccnd1 (E) and Tir2 (F) in (left) LT** 
(n = 4 or 5) and (right) Stk11*”” mice (n = 11 to 16) aged >52 weeks. WT 
refers to paired control animals (LT or Stk11*”*) for each experiment. 
(G) Immunoblot for STAT3 activation in MEFs treated for 6 hours with IL-6 
or supernatants from activated LT*, LT"*t, or LT*° CD8* T cells. Cells 
were treated with the JAK2 inhibitor AG490 for 1 hour before conditioned 
media treatment where indicated. Data are representative of three 
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separate experiments. (H) T/r2 gene expression in MEFs after culture 

(24 hours) with supernatants from activated LT™*, LT"*t, or LT*° T cells 

(n = 3 mice per genotype). Gene expression is relative to Tbp mRNA 
levels. Data represent the mean + SEM. *P < 0.05 compared with control 
samples (LT), one-way ANOVA, Dunnett’s multiple comparisons test. 

(I) Representative H&E staining of GI sections from Stk11*”~ (n = 9) 

or Stk11*”:II6-“~ (n = 20) mice aged >52 weeks. Scale bars, 5 mm. 

(J) Polyp size for animals in (1). Data represent the mean + SEM. *P < 0.05 
compared with control samples (Stk11*”~ mice), unpaired Student's 

t test. (K) Representative H&E staining of GI sections from aged Stk11*”— 
mice treated orally with vehicle control (n = 6) or AZD1480 (n = 3) for 

8 weeks. Scale bars, 5 mm. (L) Polyp size of vehicle- and AZD1480-treated 
Stk11*”— mice as in (K). Data represent the mean + SEM. *P < 0.05 
compared with control samples (vehicle-treated mice), Welch's t test. 

(M) Representative IHC for P-STAT3 (Y705), CD3 (T cell), and 
Iba-1 (macrophage) in GI sections from aged Stk11'S~ mice treated 

with vehicle control (n = 3) or AZD1480 (n = 3) in (K). Scale bars, 50 um. 


sufficient to stimulate STAT3 phosphoryl- 
ation (Fig. 4G) and promote increased expres- 
sion of a subset of STATS target genes, including 
Tlr2, in nontransformed MEFs (Fig. 4H and 
fig. SIIB). 

Last, we assessed the contribution of IL-6- 
STATS signaling to PJS polyp growth. GI polyp 
size was significantly reduced in Stk1I*/- mice 
lacking IL-6 expression (Fig. 4, I and J), similar 
to that observed in lymphocyte-deficient Stk1I*/- 
mice (Fig. 2, C and D). Polyps from Stk1I*/~;1l67”— 
mice still retained phospho-STAT3 (Y705) stain- 
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ing (fig. S11C), suggesting that other STAT3- 
activating factors such as IL-11 may be active in 
these tissues. Last, we inhibited STATS signaling 
in StkiI*”- mice in vivo using the JAK2 inhibitor 
AZD1480 (24, 25). AZD1480-treated Stk1I*”- mice 
displayed significant reductions in polyp size 
compared with that of controls (Fig. 4, K and 
L). AZD1480 treatment reduced phospho-STAT3 
levels and immune cell infiltration in GI tis- 
sues of Stk1I*/~ mice (Fig. 4M and fig. $12, A 
and B), whereas mTORCI activity was un- 
affected (fig. S12A). Thus, the blockade of STAT3 


27 July 2018 


signaling can affect polyp development in this 
PJS mouse model. 

The development of hamartomatous polyps in 
PJS patients has been attributed to cell-intrinsic 
tumor suppressor functions for StX1/ in epithelial 
and/or stromal tissue cells in the GI tract. Our 
findings establish immune-mediated inflamma- 
tion as a hallmark of PJS disease and highlight a 
critical role for Stk17 mutant T cells in PJS dis- 
ease progression. Our data argue for a more com- 
plex role for StkJ7 mutations in PJS disease 
development, with deregulated inflammatory 
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responses by LKB1 mutant immune cells, in ad- 
dition to epithelial and stromal tissues (26), re- 
inforcing tumor inflammation and chronic STAT3 
activation to drive polyp growth. Consistent with 
this, elevated IL-6 levels have been observed in 
LKB1-deficient tumors (27). Our data raise the 
possibility that inflammatory events in the GI 
tract (pathogen interactions with Stk1I*/~ im- 
mune cells) may trigger the inflammation we 
have found associated with PJS polyps in mice 
and humans to stimulate intestinal polyposis. 
Targeting chronic GI inflammation may present 
a novel approach to reducing disease incidence 
and polyp burden in PJS patients. 
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Mediator and RNA polymerase II 
clusters associate in transcription- 
dependent condensates 


Won-Ki Cho™, Jan-Hendrik Spille’*, Micca Hecht’, Choongman Lee’, Charles Li”*, 


Valentin Grube’, Ibrahim I. Cisse’+ 


Models of gene control have emerged from genetic and biochemical studies, with limited 
consideration of the spatial organization and dynamics of key components in living cells. We 
used live-cell superresolution and light-sheet imaging to study the organization and dynamics of 
the Mediator coactivator and RNA polymerase II (Pol II) directly. Mediator and Pol Il each 
form small transient and large stable clusters in living embryonic stem cells. Mediator and Pol Il 
are colocalized in the stable clusters, which associate with chromatin, have properties of 
phase-separated condensates, and are sensitive to transcriptional inhibitors. We suggest that 
large clusters of Mediator, recruited by transcription factors at large or clustered enhancer 
elements, interact with large Pol Il clusters in transcriptional condensates in vivo. 


conventional view of eukaryotic gene reg- 

ulation is that transcription factors, bound 

to enhancer DNA elements, recruit co- 

activators such as the Mediator complex, 

which is thought to interact with RNA 
polymerase II (Pol ID) at the promoter (7-5). This 
model is supported by a large body of molecular 
genetic and biochemical evidence, yet the direct 
interaction of Mediator and Pol II has not been 
observed and characterized in living cells (6). 
Using superresolution (7-9) and light-sheet imag- 
ing (10), we studied the organization and dy- 
namics of endogenous Mediator and Pol II in 
live mouse embryonic stem cells (mESCs). We 
directly tested whether Pol II and Mediator in- 
teract in a manner consistent with condensate 
formation (71-13), quantitatively characterized 
their biophysical properties, and considered the 
implications of these observations for transcrip- 
tion regulation in living mammalian cells. 

To visualize Mediator and Pol II in live cells, 
we generated mESC lines with endogenous 
Mediator and Pol II labeled with Dendra2, a 
green-to-red photoconvertible fluorescent pro- 
tein (materials and methods and figs. S1 and S2). 
We performed live-cell superresolution imaging 
and found that Mediator forms clusters (Fig. 1A 
and fig. S3) with a range of dynamic temporal 
signatures. Mediator exists in a population of 
transient small (~100 nm) clusters (Fig. 1B) with 
an average lifetime of 11.1 + 0.9 s (mean + SEM 
from 36 cells) (Fig. 1G), comparable to that of 
transient Pol II clusters observed in this study 
(Fig. 1, D, E, and H) and previously in differ- 
entiated cell types (14, 15). In addition, we ob- 
served that both Mediator and Pol II form a 
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population of large (>300 nm) clusters (~14: per 
cell), each comprising ~200 to 400 molecules, 
that are temporally stable (lasting the full ac- 
quisition window of the live-cell superresolution 
imaging) (Fig. 1, C and F to H, and figs. S4 to S6). 

We tested the extent to which these clusters 
depend on the stem cell state. The mESCs were 
subjected to a protocol (16) to differentiate them 


Fig. 1. Mediator and 
Pol II form transient 
and stable clusters in 
living mESCs. (A) A 
superresolution image of 
endogenous Mediator 
labeled with Dendra2 

in living mESCs. (B and 
C) Representative 
superresolved images of 
transient and stable 
Mediator clusters and 
corresponding time- 
correlated photoactivation 
ocalization microscopy 
tcPALM) traces. (B) and 
C) correspond to areas 
boxed in blue and yellow, 
respectively, in (A). 

D) Superresolution image 
of endogenous Pol II 
abeled with Dendra2 in 


endra2-Mediator 


into epiblastlike cells (EpiLCs) within 24 h (ma- 
terials and methods and fig. $7). Differentiation 
had no apparent effect on the population of tran- 
sient clusters, consistent with previous observa- 
tions that transient clusters persist in differentiated 
cell types (14, 15). However, both the size and 
the number of stable clusters decreased along 
the course of differentiation (fig. S8), suggesting 
that these stable clusters are prone to change as 
cells differentiate. 

We focused on the stable clusters of Mediator 
and Pol II and investigated whether they are 
colocalized. We generated mESCs with endog- 
enous Mediator and Pol II tagged with JF646- 
HaloTag (15, 17) and Dendra2, respectively 
(materials and methods and figs. S1 and S2). 
Direct imaging of both JF646-Mediator (Fig. 2A) 
and Dendra2-Pol II (Fig. 2B) showed bright spots 
of large accumulations in the nucleus, which 
corresponded to stable Pol II clusters accord- 
ing to subsequent superresolution imaging of 
Dendra2-Pol II in the same nuclei (Fig. 2C). 
The same observations were made with Dendra2- 
Mediator (fig. S9). Of 143 Mediator clusters imaged 
by dual-color light-sheet imaging (Fig. 2, D to F), 
129 (90%) had a colocalizing Pol II cluster (Fig. 2, 
G and H; materials and methods; and fig. S9). 
We conclude that these Mediator and Pol II clus- 
ters colocalize in live mESCs. 

Previous studies have shown that high den- 
sities of Mediator are located at enhancer clusters 
called super-enhancers (SEs) and that some are 
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Lifetime distributions of Mediator and Pol II clusters, respectively. Scale bars, 1 wm in (A) and (D) and 500 nm 


in (B), (C), (E), and (F). 
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A JF646- 
Mediator 


B  Dendra2- es 


Pol Il 


E Dendra2- 
Pol Il 


Dendra2-Polll 
4001 Mediator-JF646 


disrupted by loss of the BET (bromodomain 
and extraterminal family) protein BRD4, which 
is a cofactor associated with Mediator (18, 19). 
We found that treatment of mESCs with JQI, a 
drug that causes loss of BRD4 from enhancer 
chromatin, dissolved transient and stable clusters 
of both Mediator and Pol II clusters (Fig. 2, I to N, 
and fig. S10). 

After transcription initiation, Pol II transcribes 
a short distance (~100 base pairs ), pauses, and 
is released to continue elongation when phos- 
phorylated by CDK9 (20). We hypothesized that 
inhibition of CDK9 might selectively affect the Pol II 
stable clusters. We observed that upon incubation 
with DRB (5,6-dichloro-1-beta-p-ribofuranosyl- 
benzimidazole), Pol II stable clusters dissolved 
but Mediator stable clusters remained (Fig. 20). 
Quantification of Mediator-Pol II colocalization 
revealed that incubation with DRB progressively 
decreased the fraction of Mediator stable clusters 
that colocalized with Pol II (Fig. 2P). This effect 
could be reversed when DRB was washed out; 
the colocalization fraction recovered completely. 
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Fig. 2. Mediator and Pol II clusters colocalize in a transcription- 
dependent manner. Live-cell direct images of (A) JF646- 
(B) Dendra2—Pol Il. Yellow arrowheads indicate stable clusters. (C) Super- 
resolution image of Dendra2-Pol II overlaid with a background-subtracted 
JF646-Mediator image. Insets 1 to 5 show Mediator and Po 
clusters. (D) JF646-Mediator and (E) Dendra2—Pol II maximum-intensity 
projections of a fixed cell imaged by lattice light-sheet microscopy. (F) Overlay 
of background-subtracted images. Yellow arrowheads indicate clusters 
identified in the Dendra2—Pol II channel. (G) Scatter plot of the distance D from 
a Dendra2-Pol II cluster to the nearest JF646-Mediator cluster (n = 143 
clusters). Histograms outside the scatter plot show the distances along the 
X and y axes. (H) Same analysis for clusters identified in the Dendra2-Mediator 


channel (n = 67). (land J) Superresolution images of Dendra2-Mediator and 
Dendra2-Pol Il under normal conditions (left) and after 15 min (middle) or 
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These results imply that the association be- 
tween Mediator and Pol II clusters may be 
hierarchical, with upstream enhancer recruit- 
ment controlling both clusters but downstream 
transcription inhibition selectively affecting Pol II 
clusters. 

We characterized the long-term dynamics of 
stable clusters by using lattice light-sheet imag- 
ing in live mESCs (movies S1 and S2). We observed 
that clusters can merge upon contact (Fig. 3, A 
to D, and movies S1 and S2). The time scale of 
coalescence was very rapid, comparable to our 
full volumetric acquisition frame rate (15-s time 
interval). The added-up intensity of the two pre- 
cursor clusters was close to that of the newly 
merged cluster (Fig. 3E and fig. S11). These bio- 
physical dynamics are reminiscent of those of 
biomolecular condensates in vivo (21). 

In addition to coalescence, in vivo condensates 
had rapid turnover of the molecular components, 
as shown by fast recovery in fluorescence re- 
covery after photobleaching (FRAP) assays, and 
were sensitive to a nonspecific aliphatic alcohol, 
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6 hours (right) of incubation in 1 uM JQ1. (K and M) The number of transient 
Mediator and Pol II clusters per cell in a 2D focal plane as a function of time 
after JQ1 addition. (L and N) The number of stable Mediator and Pol II clusters 
per cell in a 2D focal plane. n = 17 to 25 cells and n = 14 to 24 cells at each 
JQ1 time point for Mediator and Pol Il, respectively. (0) DRB treatment 
and washout experiments. DRB (100 uM) was added at O min and washed 
away after 45 min. Arrowheads indicate stable clusters identified in the 
JF646-Mediator channel. Black arrowheads in the middle panel (bottom) 
indicate Mediator clusters that did not colocalize with Pol II clusters. 

(P) Ratio (top) and absolute number (bottom) of clusters detected in 

the Pol Il and Mediator channels per cell in a 2D foca 
15 cells were analyzed for each DRB incubation time point. The red arrow 
indicates the addition of DRB, and the blue arrow indicates DRB washout. 
Scale bars, 2 um in overview images and 200 nm in insets. 


plane. Nine to 


1,6-hexanediol (27). Our FRAP analyses of clus- 
ters revealed very rapid dynamics and turnover 
of their components: 60% of the Mediator and 
90% of Pol II components were exchanged within 
~10 s within clusters (Fig. 3, F to H). Moreover, 
the treatment of mESCs with 1,6-hexanediol 
resulted in the gradual dissolution of both Me- 
diator and Pol II clusters (Fig. 3, I to K, and fig. 
$12). Together, these results suggest that the stable 
clusters are in vivo condensates of Mediator 
and Pol II. 

We hypothesized that a phase separation 
model with induced condensation at the recruit- 
ment step of Mediator to enhancers would qual- 
itatively account for the observations in this 
study (22). The model implies that the conden- 
sates are chromatin associated and colocalize 
with enhancer-controlled active genes. We there- 
fore tested these two specific implications. We 
tracked the diffusion dynamics of Mediator clus- 
ters by computing their mean squared displace- 
ment as a function of time (n = 6 cells). On short 
time scales, the cluster motion was subdiffusive, 
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Fig. 3. Mediator and Pol II form condensates that 
coalesce, recover in FRAP, and are sensitive to hex- 
anediol. (A to E) Cluster fusion. (A) Maximum-intensity 
projection of a live cell imaged by lattice light-sheet microscopy. 
Trajectories of two clusters are indicated. (B and C) Clusters 
observed at O s and fusing at 369 s. (D) Individual time points 
around the fusion event. Orange and blue arrows indicate 

the precursor clusters, and the red arrows indicate the fused 
cluster. (E) Time course of the cluster intensities. a.u., arbitrary 
units. (F to H) FRAP analysis of clusters. (F) (Top) Images 

of a JF646-Mediator cell before (O s) (left), immediately after 
(1s) (middle), and 30 s after (right) bleaching. The red box 
indicates the position of the cluster on which the FRAP beam 
was focused. The blue box indicates an unbleached control 
locus. (Bottom) Cropped images as a function of time for both 
loci. (G) The normalized recovery curve for Mediator 

(n= 9 cells) yielded a recovery fraction of 60% during the 60-s 
observation, with a half-recovery time of 10 s. (H) FRAP 
analysis of JF646-Pol Il (n = 3 cells) yielded 90% recovery, 
with an identical half-recovery time of 10 s. (land J) Treatment 
with 10% hexanediol (v/v) gradually dissolved clusters of 
JF646-Mediator (I) and JF646—Pol II (J). Maximum-intensity 
projections of epifluorescence z stacks are shown. Yellow 
arrowheads indicate clusters identified at O min. Black arrow- 
heads indicate clusters that disappeared. (K) Average number 
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with an exponent o = 0.40 + 0.12 (best fit + SEM) 
(fig. S13). This is the same exponent found in 
the subdiffusional behavior of chromatin loci 
in eukaryotic cells (23-27). We also observed the 
same diffusional parameters when tracking a 
chromatin locus labeled by dCas9-based chi- 
meric array of guide RNA oligonucleotides 
(CARGO) in mESCs (fig. S13) (23). We concluded 
that clusters diffuse like chromatin-associated 
domains. 

We hypothesized that clusters were in close 
physical proximity to actively transcribed genes 
that can be visualized by global run-on nascent 
RNA labeling with ethynyl uridine (EU) (fig. S14). 
The run-on results showed that 2 min after DRB 
washout, virtually all Mediator clusters observed 
were proximal or overlapping with nascent RNA 
accumulations, as imaged by Click labeling of EU 
in fixed cells (fig. S14). We also employed the 
MS2 endogenous RNA labeling system (15, 28) 
(materials and methods and fig. S15) to investi- 
gate whether active transcription could be ob- 
served at Esrrb, one of the top SE-controlled 
genes in mESCs (29) (Fig. 4A). We observed 
bright foci consistent with nascent MS2-labeled 
gene loci and confirmed the gene loci by dual- 
color RNA fluorescence in situ hybridization 
(FISH) targeting the MS2 sequence and intronic 
regions of Esrrb (fig. S16). Intronic FISH on 125 
Esrrb loci from 82 fixed cells showed that 93% of 
Esrrb loci had a stable Mediator cluster nearby 
(within 1 um) but only ~22% of the loci co- 
localized with a stable Mediator cluster, suggest- 
ing that the Mediator-bound enhancer only 
occasionally colocalizes with the gene (fig. S17). 
The variability in colocalization may be explained 
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by a dynamic “kissing” model, where a distal 
Mediator cluster colocalizes with the gene only 
at certain time points (Fig. 4A). 

By dual-color three-dimensional (3D) live-cell 
imaging with lattice light-sheet microscopy, we 
found that some Mediator clusters were up to a 
micrometer away from the active Esrrb gene 
locus but in some instances directly colocalized 
with the gene (Fig. 4, B and C). In addition, we 
directly observed the dynamic interaction be- 
tween Mediator clusters and the gene locus, sup- 
porting the dynamic kissing model (Fig. 4, D and 
E; fig. S18; and movie S3). Tracking of loci in all 
six cells indicated that colocalization below our 
resolution limit of 300 nm occurred at ~30% of 
the time points (Fig. 4F). However, even when 
they were not overlapping, the Mediator cluster 
and the gene loci moved as a pair through the 
nucleus (movie S3), consistent with two adjacent 
regions anchoring to the same underlying chro- 
matin domain. We propose that Mediator clusters 
form at the Esrrb SE and then interact occa- 
sionally and transiently with the transcription 
apparatus at the Esrrb promoter. 

We have found that Mediator and Pol II form 
large stable clusters in living cells and have shown 
that these clusters have properties expected for 
biomolecular condensates. The condensate prop- 
erties were evident through coalescence, rapid 
recovery in FRAP analysis, and sensitivity to 
hexanediol. In a model of phase separation on 
the basis of scaffold-client relationships (30), it 
is possible that enhancer-associated Mediator 
forms a condensate and provides a “scaffold” 
for “client” RNA Pol II molecules. The model 
we propose whereby large Mediator clusters at 
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Fig. 4. Mediator clusters dynamically kiss 
actively transcribing SE-controlled genes. 
(A) Illustration of the working model describing 
cluster-kissing interaction with a gene locus. 
(B) Maximum-intensity projection of a cell imaged 
by using lattice light-sheet microscopy showing 
colocalization of a JF646-Mediator cluster with the 
actively transcribing Esrrb gene locus marked by 
MS2-tagged RNA (white box). (C) Single plane 
from the z stack after background subtraction. 
D) Snapshot images of a Mediator cluster near 
the actively transcribing Essrb gene locus. Time 
points listed in (B) to (D) indicate minutes after 
the start of acquisition. Scale bars, 2 um in (B) 
and (C) and 500 nm in (D). (E) Plot of the 
centroid-to-centroid distance from the gene 
ocus to the nearest cluster as a function of time. 
F) Cumulative distribution of distances from the 
Esrrb locus to the nearest Mediator cluster pooled 
from six cells (291 time points). The red dashed line 
in (E) and (F) indicates the colocalization threshold 
(300 nm). CDF, cumulative distribution function. 
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enhancers transiently kiss the transcription ap- 
paratus at promoters has a number of implica- 
tions for gene control mechanisms. The presence 
of large Mediator clusters at some enhancers 
may allow Mediator condensates to contact the 
transcription apparatus at multiple gene pro- 
moters simultaneously. The large size of the 
Mediator clusters may also mean that the ef- 
fective distance of the enhancer-promoter DNA 
elements can be in the same order as the size of 
the clusters (>300 nm), larger than the distance 
requirement for direct contact. We speculate that 
such clusters may help explain gaps of hundreds 
of nanometers that are found in previous studies 
measuring distances between functional enhancer- 
promoter DNA elements. Such cluster sizes also 
imply that some long-range interactions could go 
undetected in DNA interaction assays that de- 
pend on much closer physical proximity of en- 
hancer and promoter DNA elements. 
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every week on ScienceCareers.org are Ph.D.s 


Once their position ends, postdocs come to a fork in the road—to 
stay or leave research. This special must-read feature focuses on 
a path less traveled—careers in career development. Some work in 
Postdoc Affairs or Graduate Affairs divisions and departments or 
Careers Services organizations. If you're hiring, be sure to promote 


yheciosuncct your job opportunities to Science's highly qualified readership of 
advertise@sciencecareers.org grad students, postdocs and researchers. 


The Americas What makes Science the best choice for recruiting? 

PeO2 See GSiT # Read and respected by 400,000 readers around the globe 

Europe - 0 : ‘ 

+44 (0) 1223 326527 80% of readers read Science more often than any other journal 

Japan * Your ad dollars support AAAS and its programs, which 

+813 6459 4174 strengthens the global scientific community. 

China/Korea/Singapore/ 

Taiwan Why choose this Postdoc Feature for your 

+86 131 4114 0012 advertisement? 

= Relevant ads lead off the career section with bn 

Produced by the Science/AAAS a special “Postdoc” banner Science 


Custom Publishing Office. 
= Link on the job board homepage directly 


to postdoc positions 


=" Dedicated landing page for postdoc 
positions. 


Science Careers 


SCIENCECAREERS.ORG FROM THE JOURNAL SCIENCE JAVAAAS 


FOR RECRUITMENT IN SCIENCE, THERE’S ONLY ONE SCIENCE. 
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. Department of Pharmacology 
' Yale University 


Assistant or Associate Professor of Pharmacology 


The Department of Pharmacology at Yale University School of Medicine 
invites applications for Assistant or Associate Professor positions in the tenure 
track. Applicants must have an M.D. and/or Ph.D. or equivalent degree and 
have demonstrated excellent qualifications in research and education. Areas 
of interest include studies of molecular function such as proteomics, chemical 
biology, structural biology (including electron microscopy), and computational 
biology in different areas of drug discovery, cell signaling, metabolism, genetic 
diseases, and epigenetic mechanisms. 


Responsibilities include establishing a vigorous and independently funded 
research program in cancer biology while supervising and mentoring students 
with diverse backgrounds plus contributing to the graduate and medical school 
educational missions. We seek individuals with strong records of independent 
creative accomplishments, who will interact productively with colleagues within 
the Department and across Yale School of Medicine. 


Pharmacology at Yale University has a rich history of pre-eminence in drug 
discovery, indeed the first cancer chemotherapy clinical trials were conducted 
here. Today, the research in our Department encompasses a broad range of 
fields, with researchers aiming to both understand and treat human disease, 
with particular strengths in drug discovery, integrative cell signaling, structural 
biology, and neuroscience. For more details, see https://medicine.yale.edu/ 
pharm/ 


Please submit a letter describing qualifications, along with a CV, a two-page 
summary of current and proposed research, and three letters of reference to 
the following Interfolio website: http://apply.interfolio.com/52275. Informal 
inquiries may be submitted electronically to Prof. Joseph Schlessinger, Chair of 
Pharmacology, c/o Nathan Kucera (nathan.kucera@yale.edu). Consideration 
of applications will begin October 1*, 2018. 


Yale University is an Equal Opportunity/Affirmative Action Employer. We 
seek candidates who embrace and reflect diversity in the broadest sense. 
Yale values diversity among its students, staff, and faculty and strongly 
welcomes applications from women, persons with disabilities, protected 
veterans, and underrepresented minorities. 


Jefferson Science Fellowship 


" FROM THE AMERICAN PEOPLE 


The National Academies of Sciences, Engineering, and Medicine 
is pleased to announce a call for applications for the 2019 Jefferson 
Science Fellows (JSF) program. Initiated by the Secretary of State 
in 2003, this fellowship program engages the American academic 
science, technology, engineering and medical communities in the 
design and implementation of U.S. foreign policy. 


Jefferson Science Fellows spend one year at the U.S. Department of 
State or the U.S. Agency for International Development (USAID) 
for an on-site assignment in Washington, D.C. that may also involve 
extended stays at U.S. foreign embassies and/or missions. 


The fellowship is open to tenured, or similarly ranked, academic 
scientists, engineers, and physicians from U.S. institutions of higher 
learning. Applicants must hold U.S. citizenship and will be required 
to obtain a security clearance. 


The deadline for applications for the 2019-2020 program year 
is October 31, 2018. To learn more about the Jefferson Science 
Fellows program and to apply, visit www.nas.edu/jsf. 


The Jefferson Science Fellows program is administered by the 
National Academies of Sciences, Engineering, and Medicine and 
supported by the U.S. Department of State and the United States 

Agency for International Development. 


The National Academies of 


SCIENCES * ENGINEERING * MEDICINE 


Advance your 
career with expert 
advice from 

Science Careers. 


(3) Download Free Career Advice Booklets! 
ScienceCareers.org/booklets 
Featured Topics: 
= Networking 
= Industry or Academia 
= Job Searching 
= Non-Bench Careers 
= And More 


Science Careers 
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-e Cleveland Clinic 


Assistant or Associate Professor 
Genitourinary Malignancy Research Center 


The newly formed Genitourinary Malignancy Research Center at 
Cleveland Clinic is seeking to recruit a physician-scientist or scientist 
with a primary interest in prostate, kidney or bladder cancer. The position 
presents an exceptional opportunity to partner with outstanding clinical 
urology and oncology programs in a new center where a principal goal 
is to translate basic discoveries rapidly to the clinic. Cleveland Clinic is 
ranked the #1 hospital for urology and #2 hospital overall by US News 
and World Report. The overarching goal of the Center is to define the 
underlying biochemical and clinical behavior of prostate, kidney, and 
bladder cancer in order to identify new strategies to prevent symptoms 
and death. 


Applicants must have an M.D., M.D./Ph.D., or Ph.D., an exceptional 
track record of accomplishment, and be able to function well as part of 
a multidisciplinary team. The successful applicant will be supported by 
generous start-up funds and joint appointments in Cleveland Clinic’s 
Lerner Research Institute, Glickman Urological and Kidney Institute and 
Taussig Cancer Institute (part of the NCI-designated Case Comprehensive 
Cancer Center). 


Candidates should submit a CV, summary of research interests, and three 
references, via e-mail to Nima Sharifi, M.D. c/o Kali Casale, casalek@ 
cef.org 


Cleveland Clinic is pleased to be an Equal Employment/Affirmative 
Action Employer: Women/Minorities/Veterans/Individuals with 
Disabilities. Smoke/drug-free environment. 


online @sciencecareers.org 
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WORKING LIFE 


By Vanessa Sung 


422 


Let your stars shine 


few months ago, one of my fellow graduate students was offered an incredible career-building 
opportunity. His efforts on a student-led campaign to increase federal funding for basic research 
had led to an invitation to speak on a conference panel about health research advocacy. He would 
have a prominent platform to share his work with a large audience, and he would get to meet and 
network with leaders in the field. It would look great on his CV. I was happy for him, of course; 
he deserved it. But my feelings were complicated. I had been an equal partner in the campaign. 
I felt that I deserved the same opportunity. Yet I hadn’t been invited, and I wasn’t sure why. 


Not long ago, I wouldn’t have done 
anything about it. Growing up, I 
had been taught that the way to 
success was to put my head down 
and work hard. If my work was 
good enough, it would speak for 
itself and I would be rewarded. 
This, coupled with a natural shy- 
ness, meant that I rarely worked up 
the nerve to put myself forward. 

Then last year happened. My 
long-term relationship was in 
shambles. I was in the midst of a 
significant health scare and _ bat- 
tling doubts about where my career 
was headed. When the dust settled, I 
decided that I needed to take control 
of my life again—by becoming my 
own biggest advocate. 

So, with my heart pounding, I 
asked my colleague to ask the panel 
organizers to include me. My mind 
flooded with a litany of well-practiced self-doubt. Maybe 
I didn’t deserve it. Maybe I was being too ambitious. But 
my colleague immediately contacted the organizers, who 
quickly agreed to add me to the panel. 

I’ve come to understand that I can’t be recognized or 
rewarded for my work if I’m invisible. This isn’t a revolu- 
tionary idea, I know, but as an introvert this is my personal 
Everest. The biggest lesson I’ve learned is simply to ask for 
what you want. Ask to be nominated for an award. Ask to 
give a guest lecture. Ask for a reference letter from that 
person you think is beyond your reach. The worst that can 
happen is that the answer is no, but I’ve found that people 
are generally happy to oblige. 

I am also working to get more comfortable promoting 
my accomplishments. A couple of years ago, when a mentor 
agreed to write a recommendation letter for me but asked 
me to provide a draft as a starting point, I agonized over it. 
Writing nice things about myself felt completely unnatural. 
With every complimentary statement I committed to paper 


“T can’t be recognized or 
rewarded for my work 
if I’m invisible.” 


came mounting anxiety that my 
mentor would disapprove of my lack 
of humility. I never got any feedback 
about that letter, so I don’t know for 
sure what my mentor thought of it. 
Nonetheless, I learned an important 
lesson: I can’t control what other 
people think, so I might as well 
share my accomplishments as best I 
can. These days, whenever I ask for 
a recommendation letter, I offer to 
provide a draft. I know best how to 
highlight my skills and accomplish- 
ments for a particular application, 
and it doesn’t hurt to offer. 

I’ve also embraced social media 
as a way to share my accomplish- 
ments more widely. I was hesitant 
at first, worried that I might cross 
the line from self-promotion to 
bragging. But when I asked myself 
whether I was turned off by other 
scientists sharing their publications, awards, projects, and 
achievements, the answer was clear: nope. In fact, their 
self-promotion helped me discover amazing work and find 
new professional connections. Last year, for example, I 
was intrigued to see two young female scientists promot- 
ing their new podcast on Twitter. I commented on their 
post, suggesting science policy as an episode topic, and 
they ended up interviewing me on their show! This unique 
opportunity would never have materialized without a little 
online self-promotion on both our ends. 

I think advocating for myself will always make me a little 
uncomfortable. But I do plenty of other uncomfortable things 
to ensure I’m living my best life, like getting flu shots and (ugh) 
going to the gym. I suppose I can do the same for my career. 


Vanessa Sung is a Ph.D. candidate at McGill University 
in Montreal, Canada, and co-president of Science & Policy 
Exchange. Do you have an interesting career story? 

Send it to SciCareerEditor @aaas.org. 
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